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The  problem  of  establishing  the  effectiveness  of  an 
information  system  is  considered.  An  effectiveness 
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1.  THE  RATIONALE  FOR  ADOPTING  THE  VIEWPOINT  OF 

STATISTICAL  DECISION  THEORY  IN  EVALUATING 
INFORMATION  SYSTEMS 


1 .  1  The  Design  Process 

A  system  design  in  its  final  form  is  the  result  of  an  aggregate  of 
design  decisions  concerning  the  specifications  of  its  components  and  the 
structure  of  their  connections. 

The  information  concerning  the  merits  of  available  design  alterna¬ 
tives  is  obtained  either  through  logical  implications  of  a  performance  model 
for  the  family  of  systems  in  question  or  through  experimental  or  historical 
evidence  which  indicates  which  system  configurations  have  been  successful 
in  the  past.  A  case  in  which  design  relied  quite  heavily  on  the  experimental 
approach  is  represented  by  the  design  of  modern  automobiles.  The  deter¬ 
minations  of  increasingly  satisfactory  power  ratings,  number  and  geometry 
of  the  piston  assembly,  stroke  and  bore  dimensions,  compression  ratio, 
type  of  cycle,  etc.  ,  all  have  resulted  from  a  long  history  of  testing  alterna¬ 
tive  designs. 

In  the  case  of  the  engineering  of  information  systems,  reliance  on 
such  an  experimental  weeding  of  unsatisfactory  system's  designs  is  not 
altogether  desirable  for  the  following  reasons: 

Information  systems  tend  to  be  strongly  differentiated, 
sometimes  to  the  point  of  being  one  of  a  kind. 

Information  systems  testing  can  be  very  expensive  or, 
in  the  military,  sometimes  not  feasible. 

The  cost  resulting  from  design  errors  can  be  very  high. 

Information  systems  become  obsolete  too  rapidly  for  the 
trial  and  error  approach. 

If  in  "vivo**  testing  by  experiments  is  to  be  avoided,  then  one  is  left 
with  the  alternative  of  acquiring  the  knowledge  concerning  various  designs 
by  modeling.  Briefly,  a  model  of  a  system  characterizes,  in  the  light  of  a 
given  set  of  design  assumptions,  the  transformation  of  the  inputs  of  the 
system  into  its  outputs  and  thereby  arrives  at  the  computation  of  a  per¬ 
formance  measure  for  the  system. 


The  Value  of  Information 


1.  2 


Information  in  general  denotes  a  set  of  potential  messages  associated 
with  a  given  channel  or  system  of  information.  It  can  be  viewed  as  some¬ 
thing  which  informs  us  about  the  state  of  a  certain  environment  so  that  the 
uncertainty  associated  with  such  an  environment  can  be  expected  to  be 
reduced  if  not  completely  eliminated.  Our  desire  for  the  reduction  of 
uncertainty  concerning  the  environment  can  be  based  on  either  purely 
intellectual  motivations  (e.  g.  .  to  satisfy  the  curiosity  of  the  mind)  or  the 
need  for  making  decisions  with  the  hope  of  achieving  some  economic  goals. 

In  situations  where  the  economic  motivation  is  a  dominating  factor,  the 
value  of  an  information  system  has  to  be  assessed  by  figuring  out  how  much 
we  expect  it  to  help  us  to  attain  our  goals.  Since  information  is  the  output 
of  an  information  system,  in  order  to  determine  the  value  associated  with 
an  information  system,  we  have  to  know  the.  nature  of  our  decision  task  and 
how  the  provided  information  is  utilized  in  the  decision  process. 


1 .  3  The  Inadequacy  of  Applying  the  Traditional  Information  Theory 
in  Evaluating  Information  Systems 

Let  Z  be  a  random  variable  that  assumes  N  values,  denoted  by 
zi  .  .  .  respectively;  and  let  be  the  probability  of  z;.  Then  a 

statistical  parameter  called  the  **entropy'*  associated  with  the  random 
variable  Z,  denoted  by  H{  Z),  is  defined  as 

H(Z)  =  P(z.)  .  log  P{zJ 

1  3  1 

When  the  z's  represent  the  various  sta.tes  of  na.ture,  the  entropy  measures, 
in  some  sense,  the  degree  of  uncertainty  associated  with  the  na.ture.  When 
the  z*s  represent  the  set  of  potential  messa.ges  associated  with  a.n  informa¬ 
tion  channel,  it  is  used  by  Shannon'*'  as  a  measure  of  **the  amount  of  infor¬ 
mation.  ** 


Let  X  =  (x^  .  .  .  xjs^)  denote  the  set  of  N  possible  states  of  nature, 
and  Y  =  (yj  .  .  .  y^^)  denote  the  set  of  potential  messages  associated  witji 
an  information  channel.  Let  H(X  jyj  )  =  -  E  P(x^  lyj)  •  log  P(x-  ly^;  where 

P(xi  |yj)  is  the  conditional  probability  of  x;  given  yj.  H(xlyj)  can  be 

Shannon,  C.  Mathematica.l  Theory  of  Communication,  **  Bell  System 

Technical  Journal,  1948. 
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interpreted  as  a  measure  of  the  amount  cf  uncertainty  (about  the  nature) 
remaining  after  receiving  the  j  -'th  message.  Then  the  equivocation  of  the 
channel  Y  with  respect  to  the  source  X,  denoted  by  H(x1y),  is  defined  as 

H(XIY)  =  E  P(y.)  •  H(x|y,) 

j  J  J 

The  transmission  rate  of  Y  with  respect  to  X,  denoted  by  R(Y,  X),  is 
defined  as 


R(Y,,  X)  =  H(X)  -  H(X|Y) 

and  the  channel  capacity,  denoted  by  C(Y),  is  defined  as 

C{Y)  =  Max  R(Y,  X) 

X 

Since  the  derivation  of  the  channel  capacity  (and  that  of  transmission 
rate,  etc.  )  does  not  take  into  consideration  the  user's  utility  structure  any  - 
where,  it  is  clear  that  it  cannot  adequately  represent  the  value  of  an  infor¬ 
mation  channel  to  a  particular  user  faced  with  a  particular  decision  task  -- 
although  under  certain  special  assumptions  concerning  the  user's  utility 
structure,  the  channel  capacity  can  be  made  to  correspond  to  the  value  of 
an  information  channel  to  the  user,  * 

What  then  does  the  entropy  associated  with  an  information  channel 
represent?  According  to  Marschak,  since  the  entropy  usually  increases 
with  the  number  of  distinct  potential  messages,  and  the  larger  the  number 
of  distinct  potential  messages,  the  larger  th.e  number  of  symbols  needed  at 
a  minimum  to  distinguish  the  messages,  the  entropy  represents,  more 
appropriately,  the  cost  of  constructing  an  information  instrument.  This 
perhaps  explains,  at  least  partially,  the  fact  that  the  entropy  concept  was 
first  proposed  by  the  people  at  Bell  Lab.  A  producer  of  information 
instruments  cannot  hope  to  take  into  consideration  the  various  needs  of 
different  users  of  its  product;  however,  he  does  concern  himself  about  it 
costs  associated  with  producing  various  information  instruments. 


See  Kelly,  J.  **A  New  Interpretation  of  Information  Rate,  **  Bell  System 
Technical  Journal,  1956. 

Marschak,  J.  "Remarks  on  the  Economics  of  Information,  "  Contribu¬ 
tions  to  Scientific  Research  in  Management,  I960. 
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1 .  4  Statistical  Decision  Theory 


Statistical  decision  theory  is  concerned  with  the  derivation  of  an 
optimal  decision  rule  (in  the  face  of  uncertainty)  based  on 

The  decisionmaker's  utility  structure. 

The  decision  maker's  a  priori  probability  distribution 
over  the  states  of  nature. 

An  experiment  which  generates  observations. 

A  rule  of  revising  the  prior  distribution  upon 
receiving  an  observation. 

A  definition  of  what  constitutes  an  optimal  decision 
rule. 

In  deriving  such  an  optimal  decision  rule,  one  introduces  a  measure  of 
performance  over  the  set  of  all  possible  decision  rules  which  is  the 
expected  value  of  some  suitable  function  of  the  utilities.  A  natural  appli¬ 
cation  of  this  theory  in  evaluating  information  systems  is  to  model  the 
information  system  as  a  scattering  process  (an  experiment)  in  which  any 
particular  state  of  nature  can  give  rise  to  several  possible  messages 
(observations),  and  the  value  associated  with  an  information  system  is 
obtained  by  computing  the  expected  gain  (using  the  optimal  decision  rule) 
as  a  result  of  employing  the  system. 

It  then  follows  that  an  appropriate  measure  of  effectiveness  of  a 
given  information  system  to  a  particular  user  is  the  net  gain  in  the 
expected  utility,  resulting  from  employing  the  information  system  as  an 
aid  in  decision-making  processes  over  and  above  the  expected  utility 
which  results  when  no  information  system  is  employed. 

If  the  effectiveness  of  an  information  system  is  defined  in  the  man¬ 
ner  outlined  above,  it  will  be  logically  determined  by,  and  thus  consistent 
with,  the  utilities  chosen  and  the  statistical  hypotheses  concerning  the 
prior  uncertainty  about  the  environment. 

Since  effectivene ss- co st  equilibrium  will  determine  the  design 
recommendations,  we  ultimately  have  in  this  theory  a  means  to  make 
design  recommendations  which  are  consistent  with  the  aforementioned 
elements.  On  the  other  hand,  if  one  would  adopt,  following  a  common 
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practice,  measures  of  performance  of  components  as  the  basis  for  perform¬ 
ing  trade-off  analyses,  one  may  recommend  a  design  which  does  not  agree 
with  the  user  preference.  Such  design  may  in  fact  imply  a  utility  structure 
which  the  user  would  not  agree  with,  if  rendered  explicit  to  him.  Worse 
yet,  the  design  so  obtained  may  imply  mutually  contradictory  statements  of 
preference  so  that  there  is  no  utility  structure  which  is  consistent  with  it. 
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2. 


THE  CHOICE  OF  PREMISES 


The  method  developed  in  these  notes  allows  us  to  determine  the 
value  associated  with  a  set  of  design  specifications  for  an  information 
system  which  is  consistent  with  the  following  premises: 

.  The  utility  associated  with  an  act- state  pair  (A^,  X^)  is 
the  real  number  u(A^,  X^),  where  A^  ranges  over  all 
the  acts  open  to  the  decision  maker,  and  X^  ranges 
over  all  the  possible  states  of  the  environment. 

.  The  prior  probability  that  the  environment  will  be  in 
state  X^  is  the  probability  measure  P(X^), 

A  definition  of  what  constitutes  an  optimal  decision 
rule. 

Some  brief  comments  on  the  selection  of  particular  instances  of 
these  premises  are  in  order.  The  search  for  optimal  decision  rules 
necessarily  entails  the  establishment  of  a  criterion  of  optimality.  A 
criterion  of  optimality  can  be  established  by  specifying  a  particular 
objective  function,  in  our  case,  a  particular  utility  structure. 

An  objective  function,  in  general,  and  a  utility  structure,  in  par¬ 
ticular,  can  be  specified  in  the  following  ways: 

.  It  is  specified  by  a  ranking  scheme  among  all  the  possible 
outcomes  based  on  the  preference  of  the  decision  maker 
involved. 

It  is  specified  by  a  ranking  scheme  based  on  a  so-called 
"average  preference.  " 

It  is  specified  arbitrarily. 

The  last  possibility  can  be  ignored  because  of  its  attendant  risks  of 
irrelevancy  and  inconsistency.  The  second  possibility  can  be  elimin¬ 
ated  because  it  is  difficult,  in  general,  to  exhibit  an  "average  prefer¬ 
ence"  which  would  obtain  the  consensus  of  most  decision  makers  and 
because  the  experimental  survey  of  opinion  is  likely  to  be  expensive  or 
even  infeasible.  This  leaves  us  only  the  first  possibility.  Admittedly, 
the  ranking  scheme  then  involves  a  subjective  value  judgment  of  a  person 
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or  of  a  nonrepresentative  group"'"  and  thus  may  lack  universality  and  may  be 
highly  variable  with  time. 

The  lack  of  universality  is  immaterial  since  one  is  designing  a  sys¬ 
tem  for  a  given  user  who  may  not  agree,  or  who  even  may  have  to  disagree, 
with  some  other  user  due  to  his  different  goal  systems.  For  example,  the 
information  delivered  to  a  wing  commander  may  have  to  be  evaluated  in  a 
way  which  is  substantially  different  from  that  used  for  the  evaluation  of  the 
information  delivered  to  a  squadron  commander;  the  latter's  goal  of  destroy¬ 
ing  individual  enemy  fighters  may  be  only  a  subgoal  of  very  minor  importance 
for  the  former.  Actually,  the  point  can  be  made  that  one  of  the  uses  of  our 
method  is  to  verify  whether  the  utility  structures  of  two  distinct  users  are 
sufficiently  congruent  at  the  design  recommendation  level  to  warrant  a  joint 
design  and  sharing  of  costs,  or  whether  it  is  better  to  keep  two  separate 
systems  because  the  cost  of  compromising,  in  terms  of  lowered  perform¬ 
ance,  is  too  high. 

The  instability,  in  time,  of  the  value  structure  of  the  user  can  be 
reduced  by  proper  selection  of  the  decision-making  context^''*'  for  th 
evaluation,  but  it  cannot  be  eliminated  completely  because  of  factors  such 
as  developments  in  weapon,  sensing,  communication,  etc.  ,  technologies. 

In  any  event,  if  the  value  judgment  is  not  made  at  the  level  of  specifying 
the  utility  structure,  it  will  be  made  at  a  lower  level,  that  of  specifying 
the  system's  technical  characteristics  and,  as  said  before,  such  value 
judgments  are  equivalent  to  value  judgments  made  about  utilities  with  the 
added  disadvantage  that  the  various  system  parameter's  judgments  may 
not  be  mutually  consistent.  Since  the  method  translates  the  value  judg¬ 
ments  of  the  user  concerning  the  value  to  him  of  certain  actions  in  the 
face  of  certain  environmental  situations  into  recommendations  concerning 
the  technical  parameters  of  the  system,  it  has  the  added  advantage  of 
letting  the  user  make  the  value  judgment. 

What  is  meant  here  is  that  the  military  commander  is  more  famil¬ 
iar  with  his  world  and  act  variables  than  with  a  context  made  up  of  error 
rates,  channel  capacities,  rate  of  information  transfer,  storage  capaci¬ 
ties,  etc.  He  has,  then,  a  capability  of  making  an  informed  judgment 
within  the  decision-making  context,  but  may  have  no  real  feel  for  what 
it  is  worth  to  him  to  have  one  rather  than  another  set  of  specifications 
for  the  information  system,  especially  if  it  is  an  involved  system.  This 
in  turn  may  result  in  agreeing  to  design  recommendations,  which,  as 
repeatedly  pointed  out,  may  not  really  agree  with  the  decision  maker's 
preference. 


A  decision  maker  can  be  either  an  individual  or  a  group. 

**Toda,M.  and  Shuford,  E.  H.  ,  Jr.  TDK  No.  ESD-TDR-63-622,  Oct.  1963 
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Since  the  proposed  method  relies  heavily  on  automating  the  computa¬ 
tion  of  the  system  effectiveness,  it  would  permit  us  to  quickly  obtain  a  view 
of  the  effectiveness  of  many  alternative  system  configurations  in  the  light  of 
a  single  utility  and  prior  probability  premise.  This  would  in  turn  allow  com¬ 
parison  of  different  alternative  premises  to  test  the  stability  of  the  design 
recommendations  with  respect  to  changes  of  the  premises.  In  other  words, 
one  could  obtain  indications  of  the  more  critical  elements  in  the  premises. 

A  better  informed  judgment  on  the  part  of  the  user  could  then  be  obtained 
for  those  elements  of  either  the  utility  structure  or  the  prior  distribution, 
which  prove  to  be  most  critical. 

The  task  at  hand  can  be  considerably  reduced  if  one  considers  that: 

The  utility  structure  is,  in  general,  an  array  of  induced 
utilities'^  which  depend  on  fewer  absolute  utilities  and  a 
set  of  probabilities  of  obtaining  a  goal  given  that  a  given 
subgoal  has  been  achieved.  One  needs,  then,  to  test  the 
de sign- criticality  of  the  absolute  utilities.  (The  success 
probabilities  are,  in  general,  known.  ) 

The  prior  probabilities  may  be  expressed  in  terms  of  a 
limited  number  of  parameters  or  there  may  be  sufficient 
statistical  information  to  determine  them. 

Finally,  with  regard  to  the  selection  of  a  definition  of  an  optimal 
decision  rule,  we  can  say  that  in  this  report  we  will  adopt  a  Bayesian 
viewpoint,  i.  e.  ,  a  decision  rule  is  said  to  be  optimal  if  its  adoption 
maximizes  the  expected  utility,  using  as  weights  the  posterior  proba¬ 
bilities  which  are  computed  by  employing  Bayes  theorem. 


Toda,  M.  and  Shuford,  E.  H.  , 


Jr 


TDR  No.  ESD-TDR-63-622,  Oct  1963. 
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THE  BAYESIAN  EFFECTIVENESS  OF  INFORMATION  SYSTEMS 


3.  1  The  Bayesian  Value  of  an  Information  System 

The  usual  model  of  decision  making  under  uncertainty  assumes  that 
there  are  certain  states  of  nature  that  are  relevant  to  our  decision,  certain 
acts  that  are  open  to  us  for  choice,  and  a  utility  index  associated  with  each 
act- state  pair. 

Let  X'^  denote  the  j-th  state  of  nature,  j  =  1,  .  .  .  ,  N;  A^  denote  the  i-th 
act  open  to  us,  i  =  1,  .  .  .  ,  L;  and  ujj  be  the  utility  index  assigned  to  the  act- 
state  pair  (AS  X^),  u^j  =  U  (A^,  X'^). 

An  information  structure  can  be  most  conveniently  characterized  as 
follows’. 


1  t- 
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Y 
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IM 
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*NM 


where  Y  ,  k=  1,  .  .  .  ,  M,  is  the  k-th  message  transmitted  to  us  by  the  infor¬ 
mation  system,  and  =  P(Y^  jx^)  is  the  conditional  probability  of  the  k-th 
message  given  the  fact  that  the  true  state  of  the  nature  is  X^. 

A  rule  which  assigns  an  act  to  each  of  the  possible  messages  is  called 
a  decision  rule.  We  shall  denote  it  by  A  =  a(Y). 


The  Bayesian  decision  rule  assumes  the  following  things:  (1)  There  is 
a  certain  prior  probability  associated  with  each  state  of  nature;  we  shall 
denote  it  by  P(XJ),  j  =  1 . N.  (2)  For  each  message  observed,  an  a  pos¬ 

teriori  probability  distribution  over  the  states  of  nature  can  be  derived  by 
using  the  Bayes  theorem.  Let  P(xj  |Y^)  denote  the  .  posterior.,  probability 
of  XJ  given  the  fact  that  Y^  has  been  observed.  Then, 

P(X^lY^)  =  P(X’)  •  P(Y^|X^J^/9  •  P(Y^lxV 


-9 


(3)  Let  V(A*|Y  )  “  L  P{X'^|y  )  u..  be  the  expected  value  of  A  given  the  fact 
k  ^ 

that  Y^  has  been  observed.  Then  the  Bayesian  decision  rule  says  that  for 
each  message  Y^  one  should  select  the  act  A  =  6^(Y^)  such  that 

V[a(Y^)lY^]=  maxV(AlY^). 

A 

Let  P(Y^)  =  L  P(XJ)  •  P(Y^|X'^)  be  the  probability  of  observing  the  i-th 

j 

message  given  the  prior  probability  distribution  over  X  and  the  information 
system  x-  Then  the  Bayesian  value  of  x  is 

A  M  ;  N  :  :  ;  4 

V(x)  =  L  )  E  P(XJ|Y  )  u  [^(Y  ),  X^  ].  (3.  1-1) 

i=l  j=l 


3.  2  Some  Examples  of  Information  Systems 
3.  2.  1  Perfect  Information  System 


X 


1 


1  0 
0  1 


0 


0 

0 


if 

i  =  j 

■^^  =  0 

if 

i  ^  j 

The  perfect  information  system  is  interesting  because  it  serves 
as  a  least  upper  bound  for  the  Bayesian  value  of  an  information  system,  which 
is  a  measure  of  the  best  (according  to  the  Bayesian  criterion)  a  decision  maker 
can  do  with  the  aid  of  a  perfect  information  system.  Such  a  bound  will  tell  us 
what  the  maximum  gain  is  that  can  be  expected  by  improving  the  precision  of 
an  information  system  and  whether  an  effort  to  improve  the  information  justi¬ 
fies  its  cost. 
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3.  2.  2  Null  Information  System 


Y 


1 


Y 


M 


X 


1 


'll 


^IM 


q. .  =  q,  .  for  j  =  1,  .  .  .  ,  M. 
ij  kj 


The  null  information  system  is  interesting  because  it  serves 
as  a  greatest  lower  bound  for  the  Bayesian  value  of  an  information  system, 
which  is  a  measure  of  the  best  (according  to  the  Bayesian  criterion)  a 
decision  maker  can  do  without  the  aid  of  an  information  system. 


3.  2,  3  Imperfect  Information  System  as  a  Partition  of  X 

This  is  a  situation  where  the  information  system  serves  as 
a  scheme  to  partition  the  set  of  all  states  of  nature,  which  is  usually  the 
result  of  coding  the  states  of  nature  using  fewer  distinct  messages  than  the 
total  number  of  distinct  states  of  nature. 
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X 


1 


0  0 


0 


X 


N, 


0 


0 


xNl+1 


0 


X”2 


0 


0 


X 


Nz+l 


0 


0 


0  0  0 


q.  .=  1  if  N.,<i  <N. 

ij  J-1  "3 

q.  .  =  0  otherwise 
ij 

where  =  0 
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3.  3  The  Bayesian  Effectiveness 


As  defined  in  Section  3.  1,  the  Bayesian  decision  rule  selects  the  act 
A  =  6^(Yk)  such  that 

V[^(Y^)lYk]  =  max  V(AIy^) 

A 

i.  e.  ,  such  that 

N  .  .  N 

Z  P(X^|y^)  u  [^t  (Y  ),  ]  =  max  [  S  P(X^|Y  )  u  (A,  X^)  ] 

j=l  A  j=l 

Then  the  Bayesian  value  of  the  information  system  x  is: 

^  M  .  N  .  . 

V(x)  =  Z:  P(Y')max  [E  PIX^  |  y')  u  ( A,  X^)  ] 
i-1  A  j=l 


By  Bayes  theorem  we  have  that 


p(xVS  =  ^  ^ 


P(Yi) 


Thus 


^(x)  =  r  P(y")  max  [  E  ^  x^)]  = 

i=l  A  j=l  P(Y") 


M  N  .  . 

£  max  [  E  P(Y^|x‘’)u  (A,  X^)  ]  (3.3-1) 

i=l  A  j=l 

where  the  {P(Y^|xJ)  }  can  be  directly  specified  once  the  statistical  nature 
of  the  error  processes  in  the  information  system  is  known,  and  the  func¬ 
tions  ‘u(A,  XJ)  are  given  by 

u(A,  XJ)  =  u(A,  xj)  P(xj) 


This  suggests  that  the  effectiveness  of  an  information  system,  E(x)> 
may  be  considered  as  the  net  gain  in  the  expected  utility,  resulting  from 
employing  the  information  system  as  an  aid  in  decision-making  processes 
over  and  above  the  expected  utility  which  results  when  no  information  sys¬ 
tem  is  employed.  That  is: 
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(3.  3-2) 


E(x)  =  ^(x)  -  ^(Xq) 

where  Xq  null  information  system  defined  by 

P(xjlY^)  =  P(X^) 


for  all  i  and  j. 


The  value  of  V(Xq)  is  given  by: 


A 

V(Xo) 

II 

N 

P(Y^)  max  iL  P(X' 

i=l 

A  j=l 

N 

P(xj)  a  (A.  X^)  ]  = 

max 

[  r 

A 

N 

Tr(A.  xj)  ] 

max 

[  L 

A 

j=l 

(3.  3-3) 


Equations  ( 3.  3- 1),  (3.3-2),  and  ( 3.  3- 3)  then  allow  the  computation  of 
the  Bayesian  effectiveness  of  the  information  system  X  characterized  by  the 
“noise"  probabilities  {  P(Y^|X‘^)]  in  the  light  of  the  utility  functions 
{  u{A,  X*^)}  and  prior  distribution  {P(X*^)}. 

The  solution  of  equations  (3.  3-1)  and  (3.  3-3)  constitute  problems  of 
the  mathematical  programming  type.  In  fact,  in  both  cases  one  is  maxi¬ 
mizing  a  convex  combination  of  the  functions  u(A,  X*^).  Such  a  convex 
combination  (for  a  given  in  the  case  of  equation  (3.  3-1))  is  a  function 
of  A.  A  in  turn  can  be  limited  to  a  suitably  defined  region  of  an  n- 
dimensional  vector  space.  In  Appendix  A  we  present  a  special  case  of 
the  damage  assessment  function  which  results  in  a  nonlinear  program 
for  which  the  Kuhn- Tucker  conditions  generate  a  resolvent  algorithm. 
Whenever  it  is  possible  to  supply  an  efficient  resolvent  algorithm  for 
equations  (3.  3-1)  and  (3.  3-3),  one  is  then  in  the  position  to  evaluate  the 
effectiveness  of  the  information  system  using  equation  (3.  3-2). 

Unfortunately,  it  is  frequently  the  case  that  either  the  act  space 
or  the  utility  functions  are  such  that  no  efficient  algorithmic  solution 
exists  besides  the  exhaustion  of  alternatives. 
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In  the  following  section  we  shall  deal  with  the  case  in  which  the  con¬ 
tinuous  functions  of  a  {u(A,  X*^)]  are  replaced  by  the  columns  of  a  utility 
matrix  {u(A^,  X-^)}.  Such  discrete  formulation  is  useful  for  the  program¬ 
ming  of  a  digital  computer  to  evaluate  all  the  alternatives,  which  in  most 
practical  cases  will  be  the  only  method  available. 


3.  3.  1  The  Bayesian  Effectiveness  (Discrete  Formulation) 


In  subsequent  sections,  the  following  notations  will  be  used: 


u 


IN 


U  = 


u 


LN 


where  u.  .  =  U  (A^,  X^) 
ki 


^IM 


Q  = 


where 


^N1 


q..  -  P(yJIx') 


^NM 
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^IM 


P  = 


P 


N1 


P 


NM 

— 


where  p. .  =  P(X^|Y^) 


p(x^)  •  pcy^lx^) 

“  N 

E  P(X^)  •  P(Yj|x^) 
k=l 


P(X^)  •  P(Yj|x^) 

p(yJ) 


The  j-th  column  of  P,  denoted  by  [P]., 


1 

1 

p(x^|y^) 

^Nj 

II 

p(x^  IyJ) 
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is  the  conditional  probability  distribution  over  the  states  of  nature  if  has 
been  observed.  It  is  then  clear  that  the  j-th  column  of  UP,  denoted  by 

y  is  the  set  of  expected  utilities  associated  with  various  acts  condi¬ 
tional  on  the  occurrence  of  Y^. 

We  shall  now  define  the  operator  Let  [a]  be  a  column  vector. 


M 


Then  [A]"  =  max  {  } 

i 


Let  A  be  a  matrix, 


a 


11 


a 


IN 


■  ■  ■  ^MN 


Then  =  ([A]*  .  .  .  [A]''*  ),  where  [a],  denotes  the  j-th  column  of  A. 

1  N  J 

With  the  aid  of  the  operator  ,  we  can  define  the  Bayesian  decision 
.  .  A  1  4 

rule  as  a  (Y^)  =  A^  such  that  V(A^  |yJ)  =  [UP]*  .  Then  the  Bayesian 

j 

value  of  an  information  system  %  is  given  by 
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A  ^  j  r  . 

V(x)=E  P(Y-^)  [UP]* 

j=l 

where  P(Y-’)  =  S  P(X^)  •  P(  1)^). 
k=l 

A 

Let  E  (x  )  be  the  Bayesian  effectiveness  associated  with  an  information 
system  x  •  Then 


^  (x)  =  v(>t )  -  ^(x°) 


where  denotes  the  null  information  system. 


Consider  the  k-th  component  of  [UP]..  It  is 


N 


k  .  i 


h  - 


z  U(A  ,  X  )  P(X  lY-^)  = 
i=l 


1 


p(yJ) 


S  U(A^,  X^)  P(X^)  P(Y-’IX^). 
i=l 


Let  U  =  UD 


^P(xS  . 


where  D  = 


P(X^) 


is  a  diagonal  matrix  whose 


diagonal  elements  are  P(X^)  .  .  .  P(X^).  Then  the  k-th  component  of  [UP]. 

1  _ _  _  _  _  ^ 

is  simply  p^yJ)  kj-th  element  ofUQ. 


1 


Since  [tTQ].)*  [UQ]|, 


it  follows  that 
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M 

V(X)  =  L  P(Y'^)  [UP]* 

j=l  ■' 

M  .  I  _ 

--  £  P(Y^  tUQ].)* 

M 

=  L  [UQ]* 

j=l 

-  (UQ)*  ? 


where  5  is  a  column  vector  with  M  components  whose  values  are  all  equal 
to  1. 

Let  P  be  the  P  matrix  associated  with  the  null  information  system  X 
o  .  o 

Since  [UP^]^.  is  the  weighted  average- -with  the  weights  {P(X  )3--  of  the 
columns  of  U  and  is  independent  of  j,  we  shall  denote  it  by  [U  j.  Then 


M 


V(Xo)=  L  P(yJ)[Uq]*  =  [Uq]* 

j=l 


and 


E(x)  =  (UQ)*  §  -  [Uo]*  =  (UQ)*  §  -  (U77)'‘ 


where  is  the  vector  whose  i-th  component  is  the  prior  probability  of 
x\  P(X^). 


4.  PARTITIONING  OF  INFORMATION  SYSTEMS 


Suppose  that  each  state  of  nature  is  partitioned  into  two  parts  denoted  by 
X  and  Z,  respectively,  and  each  message  is  partitioned  into  two  parts  denoted 
by  Y  and  W,  respectively,  such  that  the  following  conditions  are  satisfied: 


(1)  P(X,  Z)  =  P(X)  P  (Z) 

(2)  P(Y|x,  Z)  =  P(Y|X,  Z')  for  all  Z  and  Z' 

(3)  P(W|X,  Z)  =  P(W|X',  Z)  for  all  X  and  X' 


This  represents  a  situation  in  which  X  and  Z  are  statistically  independent, 
and  Y  and  W  are  the  outputs  of  channels  that  do  not  cross  talk.  Then, 


p(y|x) 


P(W  |Z) 


P(wlx) 


P(Y|Z) 


s  P(Z)  P  (Y|  X,  Z) 

z 

P(YlX,  Z)  S  P  (Z) 
z 

P  (Y|x,  Z) 

S  P  (X)  P  (W|X,  Z) 
X 

p  (w |x,  z)  E  P  (X) 
X 

p  (Wjx,  Z) 

E  P  (Z)  P  (W|  X,  Z) 

z 

i:  p  (z)  p  (wjz) 
z 


P  (W) 

i:  p  (X)  p  (Yjx,  z) 

X 

i:  p  (X)  p  (Y|x) 

X 

P  (Y) 
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P(W1y)  =  2  P(WlX,  Z)  P(X,  Z|Y) 

X,  z 

=  s  p  (Wjz)  P  (XjY)  P  (ZjY) 

X,  z 

=  s  P  (W|Z)  P  (Z|Y)  L  P  (X|Y) 
Z  X 

=  s  p  (WjZ)  P  (Z|Y) 

z 

=  L  P  (W|z)  P  (  Z) 
z 

=  P  (W) 

forP(ZlY)=  =  P(Z)- 

P(W,  Y)  =  P(Y)  P  (W|Y) 

=  P  ( Y)  P  (W) 


So,  we  see  that  Y  and  W  are  statistically  independent,  and  it  follows 

that 


P(Y,  WjX,  Z)  =  P  (Y|X,  Z)  P  (W|X,  Z) 

=  P  (Y|X)  P  (W|Z)  . 

Therefore,  if  we  order  the  rows  and  columns  of  Q  so  that  states  which 
share  a  common  are  adjacent  to  each  other  and  messages  which  share  a 
common  are  adjacent  to  each  other,  then  Q  can  be  partitioned  as  follows 
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X 


X 


N 


P(yMx^)R 


P(Y^lX^)  R 


where  R  is  the  matrix. 


W 


p(wMz^) 


P(W^lZ^) 


p(y^1xMr 

P(  Y^lX^)R 


p(w'^lz^) 


p(w'^lz^) 


If  we  partition  the  matrix  U  by  ordering  its  columns  so  that  states  with 
a  common  X^  are  adjacent  to  each  other,  i.  e.  , 


1 


N 


X  ...  X 

U  =  [  ...  ] 

where  each  of  the  has  S  columns,  then. 


—  N  i,-_-  N  M.i— • 

UQ  =  [  E  P(yMx^)U^R,  ...,  E  P(Y  1X)U^R] 

i=l  i=l 

and 

_  N  ,  .  _ i  N  Mi' 

(UQ)«  =  [(  E  P(yMx^)  U  R)’!',  .  .  .  ,  (  E  P(Y  jX  )  U^R)*  ] 

i=l  i=l 
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5.  PARTITIONING  AND  SYSTEM  DESIGN 


The  result  obtained  in  the  previous  section, 


(UQ)^ 


r  ^ 

-  U  E 

i=l 


P(Y^  Ix^)  U  ^  R)*, 


N 

E  P(Y 

i=l 


M 


x') 


Rfl, 


can  be  utilized  to  shorten  the  computation  of  (U  and  (U  §  and  thus  of 
the  effectiveness  of  the  system  when  one  of  the  subsystems  is  considered 
fixed. 


In  designing,  one  may  have  sufficient  reason  to  consider  one  subsystem 
essentially  fixed.  For  example,  one  may  have  sufficient  grounds  to  expect 
that  a  given  subsystem  will,  in  all  acceptable  designs,  be  nearly  perfect. 
Then  the  size  of  the  evaluation  effort  would  be  greatly  reduced  if  the  sub¬ 
system  were  modeled  with  an  identity  matrix  with  the  effectiveness  of  the 
over-all  system  becoming  dependent  only  on  the  other  subsystem  design 
parameters.  If,  for  example,  in  the  above  formula  we  set  P{YJ1x^)  =  6  — 

where  6-  •  =  0,  i  ^  j;  6^4  =  1,  i  =  M  =  N,  i.  e.  ,  if  the  {X,  Y)  subsystem  is 
1 J  J 

assumed  to  be  perfect,  then 

(UQ)-  =  [(U^R)*  .  .  .  .  (U^R)*] 


An  alternative  formula  can  be  obtained  by  identifying  the  perfect  subsystem 
with  the  (Z,W)  subsystem,  in  which  case  R  =  I,  and 


(UQ)* 


N  ,  i  _i  N 

[(  E  P(yMx  )  U  E 

i-1  i=l 


P(Y 


M 


x")u")*  ] 


Partitioning  the  system  into  the  two  subsystems  {X,  Y)  with  model  P  and 
(Z,  W)  with  model  R  is  a  convenient  device  to  explore  the  behavior  of  system 
effectiveness  when  only  one  of  the  subsystems  is  varied.  In  particular,  if 
one  is  interested  in  determining  if  one  can  afford  to  ignore  certain  environ¬ 
mental  variables,  one  must  determine  what  happens  to  the  effectiveness  of 
the  over-all  system  as  one  of  the  two  subsystems  is  reduced  to  a  null  system, 
i.  e.  ,  the  over-all  system  is  reduced  to  the  remaining  subsystem. 

Suppose  that  the  system  to  be  eliminated  is  the  {X,  Y)  subsystem.  Then 
for  it  we  have  that. 
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for  all  i’s 


P(Y^lx^)  =  P(yS 

P(Y^lx^)  =  P(Y^)  foralli's 


P(Y^lx^)  =  P(Y^) 


for  all  i*s 


Then, 


(UQ)^:^ 


1  .  TV/f  * 

[(  E  P(Y  )  u"r)*,  .  .  .  ,  (  E  P(Y  )u"r)*] 
i=l  i=l 


1  N 

[P(Y  )  {(  E 
i=l 


U")R  }*, 


M  N 

p(y'^){(  e 

i=l 


U  )  R] 


..  ] 


In  computing  the  effectiveness  of  Q,  we  have  to  compute  (TTO)"!  ,  i.  e.  , 
the  sum  of  the  components  of  the  row  vector  (UQ)'‘S  but  this  sum  is  equal  to 
the  sum  of  the  components  of  the  vector, 

1  N  2  ^  — i  ■ 

P(Y  )  [(  E  U  )  R  ]*  +  P(Y  )  [(  E  U  )  R  ]"'  + . 

i=l  i=l 

\A  ^  N  _•  _  + 

+  P(y'^)[(E  U^)R]*  -  [(E  U^)R]''  =  (U^R)^'‘ 

i=l  i=l 

i.  e.  ,  the  effectiveness  of  subsystem  ( Z,  W)  operating  alone  can  be  computed 
in  the  usual  manner  by  constructing  a  new  U  matrix  by  summing  together  all 
the  columns  of  U  which  correspond  to  a  given  .  This  of  course  is  an 
expected  result. 

If(UQ)|,  =  [(U^R)f  (U^R)*  .  .  .  (U^R)*  ]  and  (UQjg  =  (u’''r)* 

perfect  null 

are  used,  one  can  compute  an  upper  bound  for  the  improvement  of  the  system 
effectiveness  to  be  obtained  by  extending  the  system  to  include  the  (X,  Y)  sub¬ 
system.  Such  an  upper  bound  can  be  expressed  as  a  function  of  various  design 
configurations  R  for  the  (Z,  W)  subsystem  so  that  a  truly  documented  decision 
can  be  made  on  whether  to  increase  the  detail  of  the  world  sensed  by  the 
system. 


-  24 


6.  AN  EXAMPLE  OF  EVALUATION  OF  ALTERNATIVE 
INFORMATION  SYSTEMS 

An  analysis  of  several  alternate  designs  of  a  hypothetical  air  defense 
system  is  given  in  this  section  for  three  main  reasons:  1)  to  gain  an  appre¬ 
ciation  for  potential  usefulness  of  this  approach  on  the  future  selection  and 
design  of  information  systems,  2)  to  clarify  the  theoretical  formulations 
presented  in  the  previous  sections,  and  3)  to  illustrate  the  nature  of  the 
calculations  which  are  necessary  to  carry  out  the  analysis. 

Consider  the  following  hypothetical  air  defense  system  description  as  it 
might  be  summarized  in  the  system  manual. 

’*Enemy  targets  are  detected  by  speed  and  altitude  sensors  which  are 
located  along  the  defense  perimeter.  Target  altitude  and  speed  information 
is  transmitted  from  these  sensors  to  the  headquarters  search  station  where 
the  information  is  displayed.  On  the  basis  of  this  information,  the  target 
characteristics  are  determined  and  the  threat  is  evaluated  by  the  air  threat 
coordinator.  The  availability  of  defense  forces  is  then  considered,  and 
appropriate  weapons  are  assigned  and  committed  by  the  air  defense  com¬ 
mander.  ” 

Further  technical  details  of  interest  are  given  in  later  sections  of  the 
system  manual: 

"The  information  subsystem  may  be  diagrammed  as  shown  in  Figure  A. 


Altitude 

Sensor 


Headquarters 

Display 


Speed 

Sensor 


Figure  A.  The  Information  System 


Altitude  and  speed  information  is  processed  by  an  analog  to  digital  converter 
and  transmitted  to  a  multiplexer  from  which  it  is  sent  via  a  common  channel 
to  the  headquarters  display.  The  altitude  and  speed  sensors  operate  inde¬ 
pendently  of  each  other  so  that  the  errors  of  one  sensor  are  statistically 
independent  of  the  other,  i.  e.  ,  there  is  no  cross  talk  between  them.  " 
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Given  this  brief  description  of  the  air  defense  system,  let  us  determine 
the  value  of  the  information  subsystem,  i.  e.  ,  the  system  diagrammed  in 
Figure  A,  to  the  over-all  air  defense  system. 

To  make  the  illustrative  analysis  feasible  without  the  use  of  a  computer, 
the  number  of  different  states  of  the  world  and  the  number  of  allowable  acts 
open  to  the  air  defense  commander  have  been  reduced  to  a  number  which 
allows  the  computations  to  be  made  by  hand  and  yet  preserves  most  of  the 
reality  of  this  hypothetical  situation. 

States  of  the  World  and  Messages 

Enemy  missiles,  bombers,  and  light  planes  are  the  kind  of  targets  con¬ 
fronting  this  system.  Defensive  fighters  and  missiles  will  be  available  to  the 
air  defense  commander.  Let  it  be  assumed  that  it  is  sufficient  for  purposes 
of  classifying  enemy  targets  that  the  altitude  sensor  be  capable  of  discrimina¬ 
ting  three  levels:  low  altitude,  medium  altitude,  and  high  altitude.  It  is  also 
assumed  that  it  is  sufficient  for  the  speed  sensor  to  distinguish  two  speeds: 
low  speed  and  high  speed.  The  state  of  the  world  may  then  be  represented 
by  a  Boolean  vector  (a  vector  whose  entries  are  either  one  or  zero)  with  three 
elements,  or  bits,  which  are  assigned  as  follows: 


(Xj,  X^.  X3)  =  {X) 


altitude  speed 
bits  bit 


These  state  vectors  will  be  identified  by  their  binary  number  and  are  inter¬ 
preted  as  follows: 


State 

Vector 

Number 


State 

Vector 

Elements 


INTERPRETATION 


{XpX2,  X3} 


Altitude 


Speed 


Threat 


0 

1 

2 

3 

4 

5 

6 

7 


0,  0,  0 

0,  0,  1 

0,  1,  0 

0,  1,  1 

1,  0,  0 

1,  0,  1 

1,  1,  0 

1,  1,  1 


No  Reading 
No  Reading 
Low 
Low 

Medium 

Medium 


Slow 

Fast 

Slow 

Fast 

Slow 

Fast 


Slow  Speed 
High 


No  Threat 
No  Threat 
Bomber 


Light  Plane 
Bomber 


High 

High 


Light  Plane 
No  Threat 
Missile 
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The  messages  about  the  state  of  the  world  generated  by  the  sensors  and 
transmitted  to  headquarters  are  also  represented  by  Boolean  vectors: 

(Yi,  Y^.  Y^}  =  {YJ} 

in  which  the  i-th  element  of  the  j-th  message  vector,  y?  ,  is  the  message  bit 
for  the  corresponding  variable  of  the  state  vector. 


The  Action  Set,  Resources,  and  Utilities 


It  will  be  assumed  that  the  air  defense  commander  will  always  have  at 
least  one  fighter  and  one  missile  at  his  disposal,  and  for  every  enemy  contact 
he  is  allowed  to  take  one  of  the  following  three  actions,  denoted  by  A.: 


Aq  -  Do  nothing 

A^  -  Engage  the  enemy  with  a  fighter 


A^  -  Engage  the  enemy  with  a  missile 

The  utility  associated  with  each  act  and  state  of  the  world  pair,  u(A.,  X*^), 
must  be  defined.  The  utilities  can  be  calculated  after  the  weapon  system 
parameters  and  the  cost  and  threat  values  of  all  weapons  are  specified. 


Eet  us  assume  the  following  values  for  the  system  and  target  parameters: 


Cost  of  one  fighter  mission 

'"MF 

- 

.  1 

Cost  of  one  defensive  missile 

'"m 

1.  0 

Threat  of  enemy  bomber  that 
penetrates  the  defense 

= 

-20.  0 

Threat  of  enemy  missile  that 
penetrates  the  defense 

= 

-20.  0 

Value  of  destroying  enemy  bomber 

5.  0 

Value  of  destroying  enemy  missile 

= 

3.  0 

Value  of  destroying  enemy  light  plane 

''f 

= 

2.  0 
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Probability  of  kill  for  fighter  against 
bomber  engagement  (any  altitude) 

^FB 

.  75 

Probability  of  kill  of  fighter  against 
light  plane  engagement  (any  altitude) 

'"ff  = 

.  5 

Probability  of  kill  for  missile-bomber 
engagement  for  medium  altitude 
bombers 

^MB 

.  7 

Probability  of  kill  of  missile-light 
plane  engagement  for  medium 
altitude  light  planes 

^MF 

.  7 

Probability  of  kill  for  missile-missile 

P 

MM 

.  95 

engagement 


The  general  terms  of  the  utility  matrix,  u(A.,  X'^),  were  derived  using 
the  above  notation  and  are  given  as  follows: 


U  = 


X 

0 


-c 


MF 


-C 


M 


-C 


X 

0 

MF 


X 


X' 


-C 


M 


X'* 

Tb 


0 

P  V  -C 

FB  B  MF 

^FF^F 

'  FB  B 

“^MF 

T  -  C 

B  M 

u 

1 

X^ 

X6 

0 

0 

X' 


M 


^fb^b'^mf 
<'-^fb)  ^b 

^FF^F 

“^MF 

“^MF 

P  V  -c 

MB  B  M 

{ 1  -  P  )  T 

'  mb'  B 

^MF^F 

■^M 

1 

O 

(l-P  )T 

^  mm'  Is 

derive  Uj^  which 

is  the  u 

tility  of  Ap 

engaging  the 

with  a  fighter,  where  the  fourth  state  of  the  world  is  present,  i.  e. 


x^  = 


1 

0 

LOJ 
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which  is  defined  to  be  a  bomber  at  medium  altitude.  The  u,  .  is  the  sum  of 

1,4 

the  following  terms: 

The  expected  value  of  destroying  the  bomber  with  a  fighter, 


P 


FB 


V 


B 


The  expected  threat  value  of  the  bomber  if  the  bomber 
survives  the  fighter  mission, 


B 


The  cost  of  the  fighter  mission 

■^MF 


Therefore,  4  =  ^fB^B  ^  ^^"^FB^'^B  "  ^MF'  entry  in  the 

second  row  and  fifth  column  of  uy 

When  the  values  assumed  for  these  terms  are  substituted,  we  then  have 
the  utility  matrix  used  throughout  this  analysis: 


0 

0 

-20. 00 

0 

-20.  00 

0 

0 

-20.  0 

U={u..)  = 

-  .  1 

-  .  1 

-  1. 35 

.  9 

-  1.  35 

.9 

-  .  1 

-20.  1 

-1.0 

-1.0 

-21. 00 

-1.0 

-  3.  5 

.  4 

-1.0 

1. 0 

Every 

one  of  the  eight 

possible 

states  of  nature, 

x°,  x\ 

. . .  x^. 

will  be 

assumed  to  be  equally  likely.  Therefore, 


n  - 


8 

_1 

8 


1_ 

8 
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and  therefore, 


—  1 

0 

0 

-20.  00 

0 

-20.  0 

0 

0 

-20.  0 

-  .  1 

-  .  1 

-  1.  35 

.  9 

-1.  35 

.  9 

-  .  - 

-20.  1 

-1.0 

-1.0 

-21.  00 

-  1 

-3.  5 

.  4 

-1.0 

1. 0 

0 


is  given  by, 


A 

U  = 


-7.  50 
-2.  68 
-3.  26 


I A 
U 


-2.  68. 


and 


The  Determination  of  the  System  Model 


The  system  Q,  where  Q  =  {p(Y‘^lx^},  is  the  only  variable  entity  which 
remains  to  be  determined  before  the  system  effectiveness,  E(Q),  may  be 
calculated  by 


... 

A 

E(Q)  = 

U  Q 

§  - 

U 

The  matrix  Q  may  be  thought  of  as  a  probabilistic  model  of  the  informa¬ 
tion  system.  It  provides  a  measure  of  the  fidelity  with  which  signals  or 
messages  are  transmitted;  it  reflects  the  degree  to  which  a  signal  is  degraded 
by  noise.  Individual  components  of  an  information  system  are  influenced  by 
noise  in  different  ways  as  a  result  of  inherent  differences  in  their  functions, 
design,  etc.  One  must  therefore  take  the  Q  of  each  individual  subsystem  of 
the  information  network  and  properly  combine  it  with  the  others  to  get  an 
over-all  system  Q.  The  following  illustrates  how  the  system  model  was 
obtained  for  one  of  the  system  configurations  used  in  this  evaluation.  The 
system  models  for  the  other  configurations  are  obtained  in  a  similar  man¬ 
ner.  The  system  configuration  considered  is  given  in  Figure  B.  In  this 
case,  the  system  Q  was  built  up  by  combining  the  altitude  sensor  subsystem, 

Q  ,  the  speed  sensor  subsystem,  Q  ,  and  the  main  communication  link  to 
a  s 

headquarters,  Q^.  Obviously,  one  could  carry  the  analysis  further  by  taking 
into  consideration  the  Q  of  the  sensor,  of  the  analog  to  digital  converter,  of 
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the  link  to  the  multiplexer,  etc.  The  methods  for  doing  this  would  remain 
the  same,  but,  of  course,  the  computational  load  would  increase.  The  detail 
to  which  an  analysis  is  carried  would  depend  on  its  purpose. 


Altitude 

Sensor 


Speed 

Sensor 


Binary 

Symmetric 

Channel 


Figure  3.  Subsystem  Models  and  Their  Relations 


Altitude  and  Speed  Subsystem 

The  altitude  classification  and  altitude  to  signal  conversion  may  be 
shown  as  follows,  where  p  and  X  are  specific  altitudes  with  p  <  X  • 


Target  Altitude 


No  Target 

0  -  p 

P  -  X 
Above  X 


State  of  the  World 


No  Target 
Low  Altitude 
Medium  Altitude 
High  Altitude 


State  of  the  World 
Vector  Representation 


0 

0 


0 

1 

0 

1 


Let  us  assume  that  the  only  kind  of  possible  error  in  this  subsystem  is 
of  the  type  where  the  analog  to  digital  conversion  may  err  by  only  one 
neighboring  unit.  Let  P^  be  the  probability  of  this  error.  Then  the  possible 
signals  and  their  associated  probability  of  occurrence  resulting  from  targets 
at  the  various  altitudes  will  be  as  shown  in  Table  1. 
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Table  1 


State  of  the  World 
Vector  Representation 


Possible 

Message 


Target  Altitude  X2 


No  Target 


0 


Low  Altitude  0 


Medium  Altitude  1 


High  Altitude  1 


Probability 
of  Message 


1  -  P, 


p  /2 
a 

1  -  p^ 

c 

P  /2 
a 


P  /2 
a 

1  -  p, 

c 

p  /2 
a 


P 


a 


1 


“  P 

a 


Using  the  values  from  Table  1,  it  is  seen  that  the  altitude  subsystem 
Q  matrix  is  given  by: 


=  {p{y^lx')}  = 

ij 


1  -  P 


P  /2 
a 


0 

0 


a 

1  -  P 


P  /2 
a 


P  /2 
a 

1  -  p 


0 

0 


P  /2 
a 

1  -  P„ 


The  above  matrix  is  typical  of  the  noise  process  associated  with  the  mea¬ 
surement  and  quantization  of  a  continuous  variable. 
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Let  the  speed  subsystem  error  rate  be  denoted  by  p  . 

s 

Q  matrix  is  obviously  given  by 


Then  the  resulting 


=  [p  (Y^lx^)}  = 

ij 


1  -  P. 


1  -  p 


The  Channel  Subsystem 

The  three  bits  that  make  up  the  altitude  and  speed  information  are  trans¬ 
mitted  serially  on  a  binary  channel  which  is  assumed  to  have  an  error  proba- 


bility  r.  The  error  probability  : 

is  usually  small, 

so 

that  powers  higher  than 

the  first  have  been  neglected  in 

calculating 

elements 

of  the  Q 

!  matrix.  The 

Q  matrix  is  therefore  given  by: 

l-3r 

r 

r 

0 

r 

0 

0 

0 

r  1 

-3r 

0 

r 

0 

r 

0 

0 

r 

0 

l-3r 

r 

0 

0 

r 

0 

-  {p  (Y^lx")}  =  ° 

r 

r 

l-3r 

0 

0 

0 

r 

ij  r 

0 

0 

0 

1-3 

r  r 

r 

0 

0 

r 

0 

0 

r 

l-3r 

0 

r 

0 

0 

r 

0 

r 

0 

l-3r 

r 

0 

0 

0 

r 

0 

r 

r 

l-3r 

Typical  elements  of  are 

calculated 

as  follows 

: 

p  ( Y^jX^)  =  p  (010  1  010)  = 

(1- 

r)(l-r)(l-r)  = 

(1- 

r)^ 

1  - 

3r  when  negl 

ecting  high 

order 

terms 

p  (  =  p(l00|010)  =  r  r(l-r)  =  r^  -  r^  =  0 
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System  Q 


The  procedure  for  finding  the  system  Q  applies  the  same  kind  of  approach 
used  in  finding  an  over-all  equivalent  impedance  of  an  electrical  network.  In 
an  electric  circuit  one  does  this  by  combining  individual  impedances  in  various 
series  and  parallel  combinations  according  to  the  rules  of  physics  and  vector 
calculus.  In  our  case  the  rules  for  finding  system  Q  follow  from  the  laws  of 
probability  and  matrix  algebra.  We  shall  achieve  the  combination  in  the  follow¬ 
ing  order: 

1)  Q  and  Q  are  combined  to  form  an  equivalent  Q  ,  i.  e.  , 

s  a  sensor 

Q 


Q 

s 

2)  Q  is  combined  with  Q  to  form  the  equivalent  system  Q,  i.  e.  , 

sensor  c 


Q  -  Q  -  =  -  Q  - 

sensor  c  system 


In  our  system,  Q  is  an  8x8  matrix  which  shows  the  effect  of  noise 

sensor 

introduced  in  the  sensor  subsystems.  The  individual  elements  of  Q 

sensor 

are  joint  probabilities,  and  because  of  the  assumed  statistical  independence 
of  the  operation  of  the  altitude  and  speed  subsystems,  they  are  calculated  as 
follows: 


Q  =  (p  (Y^“lx“"P))  =  p  (Y^''|X"")P  (yMxP) 

sensor  sensor  a  s 

jkl,  mnp 

For  example, 

P 

sensor 


000,  000  00,  00  0,  0 
(Y  X  )  -  p  (Y  X  )  p  (Y  X  )  =  (1-p  )(l-p  ) 
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or, 


P  (Y 

sensor 


001 


|x“»)  = 


Pa*"" 


00 


x“») 


Ps'’' 


‘lx" 


)  =  (1-D  ) 
a 


or, 


p  (y““|x“*)p  (Y‘|x“)  =  i:i  p 

sensor  a  '  s  ^  s 


etc. 


The  information  system  is  now  reduced  to  a  pair  of  Q's  in  series  which 
may  be  combined  by  matrix  multiplication.  That  is, 


{p(Y^lx')}  =  Q  = 


Q  Q 

sensor  c 


The  reason  for  this  operation  becomes  clear  when  one  examines  the  terms 
of  any  element  of  Q.  Take  the  term  Q(2,  2),  for  example,  which  is 

p(  Y®  |X® ^  ®).  According  to  the  matrix  multiplication  indicated  above, 
this  term  is  equal  to 


,  010,  010, 

p(Y  |X  ) 


7 

E 

j=0 


p  (zjlx°^°)p  (Y°^°|zj) 

sensor  '  c 


where  is  the  j-th  output  message  of  the  sensor  subsystem  which  acts  as 
the  j-th  input  message  for  the  communication  subsystem.  The  above  is  the 
correct  expression  for  the  probability  of  the  message  given  that  the 


state  of  the  world  is  X 


010 


This  is  exactly  what  this  element  of  the  Q 


matrix  is  supposed  to  represent. 


-  35  - 


6.  1  Alternative  Configurations 


In  the  following  pages  five  sets  of  systems  assumptions  are  evaluated. 
The  assumptions  were  selected  mainly  with  the  purpose  of  making  a  number 
of  points  which  indicate  what  can  be  obtained  from  evaluations  of  this  type. 

It  is  important  to  point  out  that  the  essential  characteristic  of  the  design 
process  is  the  judicious  choice  of  a  sequence  of  design  assumptions.  The 
method  does  not  entirely  determine  such  choices  but  guides  them  by  show¬ 
ing  the  relative  merits  of  alternate  choices.  It  is  not  easy,  of  course,  to 
render  the  flavor  of  a  true  design  effort  by  using  a  few  sets  of  assumptions, 
but  the  points  made  in  the  discussion  should  tend  to  do  so  in  a  partial  manner. 

6.  2  The  Determination  of  Channel  Requirements  for  Various  Sensor  Inputs 

Let  us  assume  that  one  has  n  variables  being  sensed  by  the  sensors 
Sp  •  •  •  Sj^.  An  important  question  is  what  should  be  the  amount  of  noise 
to  be  tolerated  for  each  signal  to  obtain  maximum  effectiveness  for  a  fixed 
cost  (or  maybe  maximum  effectiveness  per  dollar).  If,  in  particular,  one 
assumes  that  the  noise  is  due  mainly  to  the  channel  on  which  the  sensor 
signals  are  to  be  sent,  such  a  question  can  be  answered  by  assuming  a 
system  configuration  of  the  following  type:* 

System  Assumption  #  1 


Altitude 

Altitude  Sensor 

Sensor 

© 

Channel 

Speed 

Speed  Sensor 

Sensor 

Channel 

Headquarters 

Display 


Altitude  Sensor 

Speed  Sensor 

Altitude  sensor  channel  - 


perfect,  no  noise  (Q  -  I) 
perfect,  no  noise  -  I) 

b  A 

binary,  symmetric,  noisy  with 

error  rate  r  . 

a 


A  similar  analysis  is  possible  if  the  sensors  are  also  assumed  to  be 
noisy. 
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Speed  sensor  channel 


binary,  symmetric,  noisy  with 

error  rate  r  . 

s 

Double  and  triple  errors  are  neglected  in  computing  Qq-d’ 

The  Q  of  the  system  will  be  function  of  the  parameters  r^  and  r^.  Thus 
the  system  effectiveness  is  plotted  on  the  Tq  plane  in  Figures  1  and  2.  In 
Figure  1  the  regions  where  distinct  decision  rules  prevail  are  indicated;  in 
Figure  2  the  lines  of  equal  effectiveness  or  isoquants  are  given.  Also  in 
Figure  2  an  iso-cost  line  for  the  cost  assumption  rg  =  K  =  .  01  is  indicated. 
It  should  be  pointed  out  that  the  (.  5,  .  5)  point  corresponds  to  the  null  system, 
i.  e.  ,  the  system  which  provides  headquarters  with  no  information  whatever 
about  the  environment.  The  point  (0,  0)  corresponds  to  a  system  which  makes 
no  errors  providing  headquarters  with  perfect  knowledge  of  the  environment. 
The  system  designer  could  optimize  the  design  by  seeing  where  along  this 
iso-cost  curve  system  effectiveness  is  maximized.  In  System  1  effectiveness 
is  maximized  at  r^  =  .  5  and  r^^  =  .  02.  This  is  an  extreme  case  and  suggests 
that  at  this  particular  level  of  funding  the  speed  sensor  is  not  necessary.  The 
extreme  location  of  this  solution  is  a  result  of  the  approximations  made  in 
finding  Qq  =  Qgp  •  Actually,  if  the  higher  order  terms  had  not  been 

neglected,  the  effectiveness  would  have  decreased  more  rapidly  than  is 
indicated  in  Figure  2,  and  the  optimum  point  would  probably  have  been 
located  near  rg  =  .  35,  r^^  -  .  03. 

Demonstrated  here  is  the  fact  that  the  most  effective  fixed  cost  system 
is  not  one  with  equal  error  rates  for  the  two  signals,  but  one  in  which  the 
error  rates  are  very  uneven.  Without  an  analysis  utilizing  an  underlying 
structure  of  the  entire  system,  one  would  have  no  rationale  to  make  such 
an  assertion.  Moreover,  one  would  probably  have  adopted  an  equal  accuracy 
design,  possibly  based  on  such  an  irrelevant  criterion  as  the  minimum  sum 
of  errors  function  for  a  given  allocation  of  funds. 

In  general,  one  attempts  to  use  a  single  communication  channel  by 
feeding  the  output  of  various  sensors  into  a  time  or  frequency  multiplexer. 
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Thus,  one  might  object  that  the  system  assumption  given  above  is  not  a  useful 
one,  since  it  presupposes  separate  channels  for  the  altitude  and  speed  signals. 
The  point  to  be  made  is  that  such  an  assumption  should  be  considered  as  a 
prerequisite  for  assessing  if  multiplexing  can  be  performed  without  any 
encoding  of  the  various  signals.  What  we  have  in  mind  is  that  if  the  optimal 
choices  for  the  error  probabilities  of  the  various  individual  channels  turn  out 
to  be  nearly  the  same,  one  can  multiplex  the  signals  and  feed  them  to  a  com¬ 
mon  channel  without  further  ado.  The  capacity  of  such  channels  would  be 
determined  2  by  the  bit  rate  and  the  said  (common)  error  probability.  If, 
on  the  other  hand,  the  various  recommendations  of  the  error  rates  are  highly 
dissimilar,  one  has  to  encode  the  signals  for  which  high  accuracy  is  required 
so  that  their  encoded  form  can  be  transmitted  at  the  noise  levels  which  are 
permissible  for  the  lowest  accuracy  signal.  The  common  channel  can  then 
be  designed  for  the  lowest  accuracy,  thereby  avoiding  the  waste  of  channel 
capacity.  Thus,  in  the  case  of  this  example,  the  system  to  be  realized 
would  take  the  form  illustrated  in  Figure  D. 


System  Assumption  #2 


This  is  identical  to  the  above  system  assumption  except  that  the  alti¬ 
tude  signal  is  assumed  to  be  coded  with  a  single  error  correcting  code;  in 
other  words,  the  part  of  the  system  model 


Q, 


SP 


is  given  by: 
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yJ 


00 

01 

10 

11 

00 

0 

0 

Ra 

01 

0 

1-R 

R 

a  a 

0 

10 

0 

R 

a 

1-R 

a 

0 

11 

R 

a 

0 

0 

instead  of: 

yj 

00 

01 

10 

11 

00 

(l-rj2 

r  (1-r  ) 

^  a 

2 

^a 

01 

(1-r 
'  a' 

2 

^a 

10 

r  (  1  -r  ) 
a'  a 

2 

r 

a 

(l-rj2 

'al'-'-a* 

11 

2 

r 

2 

(l-Ta) 

R  is  the  probability  of  a  double  error,  and  r  is  the  probability  of  a  single 


A  comparison  of  the  isoquants  for  Systems  1  and  2,  Figures  2  and  4, 
brings  out  another  interesting  effect,  namely,  the  unexpected  manner  in 
which  coding  can  change  the  dependence  of  system  effectiveness  on  the 
error  rates  of  the  channels.  The  comparison  of  these  two  systems  shows 
that  the  criticality  of  the  speed  information  is  decreased  if  the  information 
in  the  altitude  information  is  single  error  corrected.  Again,  this  sort  of 
effect  can  be  determined  only  if  the  interaction  of  the  various  design  param¬ 
eters  is  considered  through  a  utility  structure. 
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6.  3  Sensor  versus  Channel  Accuracies 


The  next  set  of  three  systems  assumptions  is  introduced  for  two  rea¬ 
sons:  The  first  is  to  introduce  systems  with  both  parallel  and  serial  con¬ 
nections  of  subsystems.  The  second  is  to  show  how  the  over-all  system 
model  permits  one  to  balance  out  the  design  of  all  the  components  of  the 
system.  That  is,  one  can  find  an  optimal  set  of  accuracy  specifications 
for  the  sensors,  channels,  buffers,  etc.  ,  of  the  system  so  that  such  speci¬ 
fication  is  internally  consistent.  The  specifications  of  these  systems  are: 

System  3.  This  is  the  general  configuration  for  which  the  detailed 
computations  were  presented  in  the  previous  few  pages.  A  diagram  of  this 
system  is  given  here  again. 


The  system  assumptions  are: 


Altitude  sensor 


noisy  with  error  rate  to  neighboring  unit 


P 


a 


Speed  sensor 
Sensor  channels 


noisy  with  error  rate  p 
perfect,  no  noise,  r  =  0  ( Qg^ 


I) 


Main  channel  -  binary,  symmetric,  noisy  with  error  rate 

r  =  .  1.  High  order  terms  are  neglected  in 
calculating  Q«- 


System  4.  The  same  system  as  System  3  except  for  the  main  channel 
The  main  channel  is  noisy  with  an  error  rate  r  =  .  1.  Single  errors  are 
eliminated  by  the  adoption  of  a  single  error  correcting  code. 

System  5.  The  same  system  as  System  3  except  for  the  main  channel 
The  main  channel  is  assumed  to  be  perfect,  r  =  0  (Q  =  I). 
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The  utility  matrix,  the  state  distribution,  and  the  allowable  acts  remain 
the  same  for  Systems  1  through  5.  The  detailed  computations  for  Systems  1 
through  4  are  not  presented  since  the  mathematics  is  essentially  the  same  as 
for  System  5. 

Comparing  Systems  3,  4,  and  5,  Figures  6,  8,  and  10,  it  is  immediately 
apparent  that  System  5  is  uniformly  better  than  System  4,  which  in  turn  is 
uniformly  better  than  System  3.  This  is  as  it  should  be,  since  going  from 
System  5  to  System  3  the  communication  channel  becomes  increasingly  more 
noisy  while  the  sensing  subsystem  is  the  same  for  all  three.  Comparing 
Figures  6,  8,  and  10  also  shows  that  most  of  the  effectiveness  loss  due  to 
degrading  the  channels  from  a  perfect  system  (System  5)  to  a  system  with  a 
10%  error  rate  (System  3)  is  recouped  by  the  introduction  of  a  single  error 
correcting  channel  (System  4).  In  other  words,  in  this  case  the  adoption  of 
a  single  error  correcting  code  for  the  main  communication  link  to  headquarters 
results  in  essentially  neutralizing  the  effects  of  a  10%  error  rate  in  this  link. 

It  should  be  pointed  out  that  a  10%  error  rate  is  an  extremely  high  one  for 
communication  channels  and  would  actually  result  from  man-made  interfer¬ 
ence  and  jamming.  Here  we  see  in  a  simple  way  how  this  type  of  evaluation 
can  furnish  quantitative  arguments  for  the  adoption  of  a  coding  scheme  for  a 
channel  which  one  can  expect  to  be  subjected  to  man-made  interference. 

The  sensors  noise  for  Systems  3,  4,  and  5  have  been  characterized  by 
the  matrices: 
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for  the  altitude  sensor,  and  by 
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s 

0  1 
Ps 

Ps  ^-Pb 

for  the  speed  sensor.  Sensing  errors  can  occur  only  as  jumpts  to  an  adjacent 
value  of  the  variable  being  measured.  This  type  of  error  process  is  charac¬ 
teristic  of  the  measurement  of  an  analog  quantity.  It  would  also  be  charac¬ 
teristic  of  a  communication  channel  utilizing  non- digitalized  signals  as,  for 
example,  a  radio  link  utilizing  for  amplitude  modulation  envelope  the  value 
of  an  analog  quantity.  On  the  other  hand,  if  a  digitalized  representation  is 
used  (a  representation  in  which  the  position  in  the  code  block  is  significant), 
the  channel  noise  process  is  quite  different,  since  errors  can  occur  between 
non-adjacent  values.  Because  of  these  characteristics  of  digitalized  trans¬ 
mission,  errors  in  a  digital  channel  can  be  much  more  harmful  than  those 
in  an  analog  channel  or  than  those  in  the  sensors  which  are  usually  of  analog 
type.  The  simple  examples  we  have  computed  demonstrate  this  in  a  quan¬ 
titative  fashion,  at  least  with  regard  to  the  relative  importance  between  the 
sensor  noise  and  the  digital  channel  noise. 

We  can  schematically  represent  Systems  3,  4,  and  5  as  follows: 


The  point  of  coordinates  (0,  0)  in  Figure  10  gives  the  effectiveness  of  the 
system  which  has  noise-free  sensors  and  channels.  This  effectiveness  is 
E(Q)  =  2.  65  and  is  the  least  upper  bound  for  all  systems  with  the  above 
structure.  Let  us  examine  what  happens  to  the  effectiveness  of  the  system 
as  we  degrade  its  three  components  from  the  0%  to  the  10%  error  rate  level. 

a)  Degrade  the  speed  sensor  (0%  _ ^  10%)  but  leave  the 

altitude  sensor  and  the  channel  at  the  no-error  level. 

E(Q)  is  read  at  the  point  of  coordinates  Pg  =  .  1  =  0 

of  Figure  10;  it  is  seen  to  be  E(Q)  =  2.  19-  The  perform¬ 
ance  of  the  system  has  lost  about  6%  of  perfect  perform¬ 
ance. 
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b)  Degrade  the  altitude  sensor  by  10%  leaving  the  speed  sensor 
and  channel  perfect.  This  corresponds  to  the  point  =  ,  1 
Pg  =  0  of  Figure  10.  Then  E(Q)  =  2.  54  with  a  loss  of  about 
4%  of  perfect  performance. 

c)  Degrade  both  altitude  and  speed  sensor  by  10%  leaving  the 

channel  perfect,  ^  ^  Ps  “  •  ^  Figure  10,  E(Q)  =  2.  36 

or  about  11%  of  perfect  performance  is  lost. 

d)  Degrade  the  channel  by  10%  but  leave  the  two  sensors  per¬ 
fect.  This  corresponds  to  the  point  P^  ^  0  =  0  of  Figure 

6  (which  corresponds  to  the  system  with  the  channel  with  an 
error  rate  of  10%).  E(  Q)  =  1,97.  This  is  a  loss  of  better 
than  25%  of  the  perfect  performance. 


Thus,  we  see  that  a  10%  degradation  of  the  communication  channel 
is  much  more  harmful  than  a  similar  degradation  for  all  the  sensors. 

The  above  comparisons  indicate  then  that  an  optimal  design  may 
require  a  greater  accuracy  for  the  channel  than  for  the  sensors  and  not 
an  equal  one.  Furthermore,  one  has  obtained  a  quantitative  measure 
concerning  such  relative  accuracies.  These  are  not  simple  orderings 
derived  through  intuition. 

Observing  the  manner  in  which  the  shape  of  the  isoquants  change 
in  going  from  System  3  to  4  to  5,  one  sees  the  strong  influence  which 
coding  of  the  signals  in  the  main  channel  has  on  changing  the  relative 
importance  of  the  speed  and  altitude  sensor  errors  on  system  effective¬ 
ness.  It  is  seen  that  the  criticality  of  the  speed  sensor  is  progressively 
reduced  in  advancing  to  the  system  with  the  perfect  channel.  This  indi¬ 
cates  that  if  the  channel  is  jammed,  one  has  to  rely  more  heavily  on  the 
speed  of  the  threat  information  to  discriminate  missiles  from  planes. 
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6.  4  Decision  Rules 


The  set  of  figures  1,  3,  5,  7,  and  9  indicate  the  way  the  indifference 
lines  partition  the  system  space.  An  indifference  line  or  contour  is  a  locus 
of  those  system  designs  which  admit  at  least  two  optimal  decision  rules.  A 

decision  rule  is  a  set  of  prescriptions  of  the  type  YJ  _ ^  Aj^  which  can  be 

read:  if  is  observed,  select  the  action  Aj^.  The  indifference  lines  are 
then  the  locus  of  points  which  divide  the  system  designs  into  those  which 
admit  as  the  optimal  rule  some  Aj^  =  d2(YJ)  and  those  which  admit  some 
Ak  =  d2(Y‘^).  The  two  rules  dj  and  d2  differ  in  at  least  one  prescription 
YJ 


^  Aj^.  If  dj  and  differ  in  the  prescriptioh  yJ - ^  Ak>  the  indif¬ 


ference  line  is  marked  with  the  said  yJ  and  also  with  a  marking  which 
shows  which  Ak  is  to  be  selected  on  each  side  of  the  indifference  line. 

As  an  example,  consider  the  region  near  the  origin  of  Figure  9.  This  is 
the  region  in  which  both  the  sensors  and  channels  are  liearly  perfect.  The 


rule  is 

seen  to  be: 

y“  . 

Y^ 

- >  A 

y'  ■ 

Y^ 

— >  A 

. 

Y^ 

— >  A 

Y^  . 

y’ 

- >  A 

Because  the  systems  with  design  characteristics  within  that  region  are 
systems  that  have  infrequent  errors,  we  can  say  that  if  we  receive  a 
message  indicating  that  there  is  a  bomber,  we  can  be  almost  totally  sure 
that  in  actuality  we  are  facing  a  bomber.  Thus,  the  decision  rule  is: 

Y^  E  No  Threat  - ^  Do  Nothing  e  A^ 

Y  =  No  Threat  - ^  Do  Nothing  e.  Aq 

2 

Y  =  Low  Altitude  Bomber - ^  Engage  with  Fighter  =  A^ 

Y  =  Low  Altitude  Light  Plane - ^  Engage  with  Fighter 

Y  =  High  Altitude  Bomber - ^  Engage  with  Fighter  =.  A^ 

Y^  =  High  Altitude  Light  Plane - ^  Engage  with  Fighter  =  A 

-6  ^  ^ 

Y  E.  No  Threat  - ^  Do  Nothing  =  Aq 

7 

Y  Missile - ^  Engage  with  Missile  =  A^ 
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which  of  course  is  the  decision  rule  that  intuition  suggests  to  be  best.  If, 
on  the  other  hand,  we  are  receiving  information  from  the  system  corre¬ 
sponding  to  the  p  =  .  1  p  =  .  1  point  of  Figure  5,  i.  e.  ,  if  because  of 
a  s 

enemy  action  our  sensors  and  channels  both  have  an  error  rate  of  10%, 
due  to  jamming  or  actual  physical  damage,  we  can  trust  our  system  much 
less,  and  we  escalate  our  response.  In  fact,  the  optimal  decision  rule  is: 


**No  Threat” ^  =  Engage  with  Fighter 

”No  Threat” ^  e  Engage  with  Fighter 

"Low  Altitude  Bomber” - E  Engage  with  Fighter 

"Low  Altitude  Light  Plane” - ^  A^  =  Engage  with  Fighter 

"High  Altitude  Bomber" - ^  A^  E  Engage  with  Fighter 

"High  Altitude  Light  Plane” - ^  A^  H.  Engage  with  Missile 

"No  Threat”  - ^  A^  H  Engage  with  Missile 

"Missile"  - ^  A^  =  Engage  with  Missile 


The  various  interpretations  of  the  signals  are  subject  to  doubt  due  to  rela¬ 
tively  high  noise  content  in  the  system.  This  is  why  is  indicated  to  have 
a  "No  Threat"  interpretation  where  the  quotation  marks  emphasize  that  Y® 
may  correspond  to  other  things.  For  example,  Y®  =  000  may  have  originated 
from  =  010  which  is  a  bomber.  This  is  why  a  supposedly  no  threat  condi¬ 
tion  evokes  a  fighter  mission  response  as  insurance  against  the  errors  of 
the  information  system.  The  cases  of  Y^  and  Y^  are  particularly  interesting 
for  Y^  =101  E  light  plane.  An  error  in  the  second  altitude  digit  (due  to 
either  the  channel  or  the  sensor)  may  have  caused  Y^  =  101  to  actually  origi¬ 
nate  from  x7  =111,  that  is,  a  missile.  This  is  the  main  reason  for  re¬ 
sponding  then  with  an  anti-missile-missile.  For  Y^  =  110,  which  is  a  no 
threat  condition,  an  error  in  the  speed  bit  could  cause  a  missile  not  to  be 
detected.  This  is  why  an  anti-missile-missile  response  is  advisable.  By 
comparing  the  two  decision  rules  so  obtained,  one  can  clearly  see  that  a 
10%  degradation  of  the  three  components  of  the  system  causes  a  major 
escalation  of  the  responses  specified  by  the  optimal  decision  rules. 
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SYSTEM  1 


DECISION  RULES  d(rg,  r^) 


Constant  Decision 
Rules 


The  lines  shown  indicate  the  boundaries 
which  divide  regions  requiring  a  different 
rule  of  response.  Each  line  is  labeled 
with  a)  the  message  requiring  the  differ¬ 
ential  response,  b)  the  optimal  resporse 
on  both  sides  of  the  line. 

Figure  1.  Decision  Rules. 


Aq  :  Do  Nothing 
Ai  :  Engage  with  fighter 
A2  :  Engage  with  missile 
yj  :  Defined  on  page  27 
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Altitude  Channel  P  Error  Rate 


SYSTEM  1 


PERFECT  SENSORS 
Sensor  Channels 
Symmetric,  Binary 
Noisy 


r 

s 


Speed  Channel  Error  Rate 


ISOQUANT 


ISO  COST 

r  r  =  . 01 
a  s 


Figure  2.  System  Effectiveness,  E  (Q) 
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SYSTEM  2 


DECISION  RULES 

d(rg  %) 


Constant  Decision 
Rules 

Message  Action 


r 


s 


The  lines  shown  indicate  the  boundaries 
which  divide  regions  requiring  a  different 
rule  of  response.  Each  line  is  labeled 
with  a)  the  message  requiring  the  differ¬ 
ential  response,  b)  the  optimal  response 
on  both  sides  of  the  line^ 


NOTE 

A  :  Do  Nothing 

A^:  Engage  with  fighter 

A^*  Engage  with  missile 

Y*^  :  Defined  on  Page  27 


Figure  3.  Decision  Rules 
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SYSTEM  2 


s 

Speed  Channel  Error  Rate 


ISOQUANT 


Figure  4.  System  Effectiveness,  E(Q) 
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SYSTEM  3 


DECISION  RULES  d  (p^  Pa) 


Constant  Decision 
Rules 


Ps 

The  lines  shown  indicate  the  boundaries 
which  divide  regions  requiring  a  different 
rule  of  response.  Each  line  is  labeled 
with  a)  the  message  requiring  the  differ¬ 
ential  response,  b)  the  optimal  response 
on  both  sides  of  the  line^ 


NOTE: 

A  •  Do  nothing 
o 

Engage  with  fighter 
A  :  Engage  with  missile 
:  Defined  on  page  27 


Figure  5.  Decision  Rules. 
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Altitude  Sensor  Error  Rate 


Figure  6.  System  Effectiveness,  E(Q) 
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SYSTEM  4 


DECISION  RULES  d  (p  ,  P  ) 

S  3. 


Constant  Decision 
Rules 


The  lines  shown  indicate  the  boundaries 
which  divide  regions  requiring  a  different 
rule  of  response.  Each  line  is  labeled 
with  a)  the  message  requiring  the  differ¬ 
ential  response,  b)  the  optimal  response 
on  both  sides  of  the  line« 

Figure  ?•  Decision  Rules, 


Message 

Y 


NOTE: 

A  : 

A°- 
Yj: 


Acti  on 


Do  nothing 
Engage  with  fighter 
Engage  with  missile 
Defined  on  page  27 
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Altitude  Sensor  Error  Rate 


SYSTEM  4 


Ps 

Speed  Sensor  Error  Rate 


ISOQUANT 


Figure  8.  System  Effectiveness,  E(Q) 
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SYSTEM  5 


DECISION  RULE  d{P^,p^) 


Ps 


Constant  Decision 
Rules 

Message  Action 


The  lines  shown  indicate  the  boundaries 
which  divide  regions  requiring  a  different 
rule  of  response.  Each  line  is  labeled 
with  a)  the  message  requiring  the  differ¬ 
ential  response,  b)  the  optimal  response 
on  both  sides  of  the  line. 


NOTE: 

A  : 
o 


Do  nothing 
Engage  with  fighter 
Engage  with  missile 
Defined  on  page  27 


Figure  9.  Decision  Rules 
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SYSTEM  5 


Speed  Sensor  Error  Rate 


Figure  10.  System  Effectiveness,  E(Q) 


© 


ISOQUANT 
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APPENDIX  I 


1.  AN  IDEALIZED  BOMB -DAMAGE -ASSESSMENT  COMMAND 
AND  CONTROL  SYSTEM 


1.  1  Description  of  the  Organization 


Assume  that  a  military  organization  is  engaged  in  the  activity  of 
bombing  a  set  of  n  enemy  targets.  After  the  initial  round  of  bombing, 
the  organization  must  make  decisions  on  how  to  allocate  the  remaining 
weapons  on  the  basis  of  the  information  it  has  concerning  the  world. 

We  begin  by  simplifying  the  problem  and  assuming  that  each  target  can 
only  be  in  one  of  two  states:  1)  target  is  still  operational,  2)  target  is 
out  of  commission.  Models  which  admit  of  a  greater  but  finite  number 
of  target  states  present  no  special  difficulty.  The  state  of  the  world  is 
then  a  vector  of  n  components,  one  for  each  initial  target.  The  entry 
is  zero  for  all  components  which  correspond  to  targets  which  have  been 
effectively  destroyed,  and  is  one  for  those  which  are  still  operational. 

A  bombing  action  can  thus  result  in  any  one  of  the  2^  distinct  Boolean 
vectors.  These  range  from  00 - 0  to  11 - 1  and  correspond,  re¬ 

spectively,  to  the  completely  successful  mission  and  to  the  completely 
unsuccessful  mission.  The  message  or  observation  vector  is  a  vector 
of,  at  most,  n  components  which  can  also  assume  the  values  zero  and 
one. 


The  command  and  control  system  for  the  bomb- damage-assess¬ 
ment  function  can  be  imperfect  on  one  or  both  of  two  distinct  counts. 

One,  the  message  or  observation  vector  may  be  incomplete,  i.  e.  ,  the 
message  vector  has  fewer  components  than  the  state  vector.  The  other, 
the  message  vector  may  have  an  error,  i.  e.  ,  there  may  be  a  non- zero 
probability  that,  for  at  least  some  of  the  elements,  the  message  and  the 
corresponding  state- of- the -world  element  do  not  coincide. 

The  i-th  target  is  characterized  by  a  threat  value  t^  and  a  value  Vj^. 
The  quantity  t^  is  negative  and  represents  the  risk  incurred  in  not  killing 
the  i^^  target.  The  quantity  v^  is  positive  and  represents  the  utility  of 
destroying  the  i-^h  target. 

The  weapons  used  by  the  organization  are  assumed  to  be  all  iden¬ 
tical  with  cost  C  and  a  probability  of  kill  (  1-q).  Thus,  q  is  the  probability 
of  failure. 


The  acts  open  to  the  organization  are  various  allocations  of  the 
available  weapons  on  the  n  potential  targets.  Thus,  the  act  vector  is  a 
vector  with  n  components,  the  i-th  component,  aj  ,  being  equal  to  the 
number  (integer)  of  weapons  allocated  against  the  i-th  target. 

With  the  above  assumption,  the  utility  which  we  can  associate 
with  an  action  state-of-the-world  pair  takes  the  form: 

n  3^. 

u(A,X)=  E  [  (1  -  q  X.  V.  +  q  ^  X.  t.  -  Ca.  ]  (I  1.1-1) 

i=  1 


1.  2  Operating  Constraints 

The  sets  of  acts  which  are  admissible  in  this  organization  are  those 
vectors  such  that: 


I'  A  <  R 
A  =  [A]  >  0 


(I  1.2-1) 


where  I  is  a  column  vector  of  n  ones,  and  [a]  is  the  vector  of  nearest 
integers.  The  first  constraint  expresses  the  fact  that  there  are  R  weapons 
available;  thus  an  admissible  act  should  not  utilize  more  than  R  weapons. 
The  second  expresses  the  fact  that  allocations  are  constituted  by  integer 
numbers  of  weapons  for  each  target  and  that  these  numbers  are  non-nega¬ 
tive.  The  set  of  A's  so  constrained  is  an  integral  simplex  of  **Size**  R  in 
n+  1  dimensions.  In  n  dimensional  space  it  is  the  convex  set  whose  ver¬ 
tices  are  the  origin  and  the  columns  (rows)  of  the  indentity  matrix  mul¬ 
tiplied  by  R. 


1.  3  The  Computation  of  V(x)  for  a  Generic  Bomb-Damage- 
Assessment  System 

Using  equations  (3.  3-1)  and  (  I  1.  1-1),  one  obtains  for  ^(x)  the 
following  expression: 


A 

V(X) 


M 

E  max 
i=l  A 


IN  .  i  \  ^ 

[  E  P(X‘^)  P(Y  Ix*^)  L 

j=l  k=:l 


i 
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r 

=  L  max  [  L  (1-q  V,  +  q  ^  T,  -  Ca  1  (II.  3-1) 

.  ,  ,  ,  '  k  k  k 


where 

N 

=  L  P(Y^|x^)  P(X-^)  xj. 

j=l 

yi  N  .  . 

=  L  P(Y^|X'^)  PCX'^) 

j=l 

yi  N  .  .  . 

C  =  L  PCY^IX'^)  PCX'^)  C  =  C  P(  Y^) 

j=l 


Consequently,  to  compute  the  value  of  V(x)  for  our  type  of  organiza¬ 
tion,  one  must  solve  the  following  type  of  nonlinear  mathematical  program¬ 
ming  problem: 


Max 

A 


{  L 


k=l 


[(1-q^ 


k 


+ 


^k 

q 


]} 


Subject  to 

n 

L  a  <  R  (a  =  positive  integer)  (.1  1.  3-2) 

iC  ““  iC 


Actually,  one  has  to  .solve  a  problem  of  the  type  (11.3-2)  for  each 
of  the  distinct  messages  Y^.  For  the  sake  of  simplicity  in  discussing  the 
solution  of  (  I  1.  3-2),  we  will  replace  and  CY^  with 

and  C,  respectively,  since  is  fixed  for  each  problem  (  I  1.  3-2).  It 
can  be  shown  (AppendixII)  that  the  Kuhn-Tucker  conditions  [2]  are  nec¬ 
essary  and  sufficient  for  a  problem  of  the  type  (I  1.3-2).  The  conditions 
are: 


For  some 


u^  >  0 

w^  >  0 


o 

-  w 


[I] 
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g  being  the  gradient  of  the  function  g  computed  at  the  optimal  point  (act) 
a°,  i.  e.  , 

g°  =  [Ag(a)]^^^o 


F  is  the  matrix  whose  rows  are  the  gradients  of  the  individual  constraint 
surfaces  of  (  I  1.  3-2)  computed  at  a^,  i.  e.  , 


F 


o 


Atj  (a)' 
A  £3  (a)' 


A£  (a)' 

m 


o 

a=a 


In  our  case,  the  e-th  component  of  g^  is: 


aO 

(T  -  V  )  log  q  q  ®  -  C 
e  e 


Ca^} 


a=a^ 


Also,  since  there  is  a  single  constraining  surface,  F^  has  a  single  row 
whose  e-th  component  is: 


a=a 


1 


Thus  [I  ]  in  our  case  is: 

(T^  -  V^)  log  q  q^l  -  C 
(T^  -  V^)  log  q  q^2  _  c 


o 

W  J 


w 


o 

2 


(T^  -  V^)  log  q  q^*"  -  C 

where  Uj  >  0  and  w°  >  0.  Furthermore, 
such  that  (a°)  =  0,  then  u°  >  0. 


since  the  optimal  point  will  be 
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Let  us  assume  to  begin  with  that  for  the  optimal  act  aj^  >  0  for  all  k*s. 
Then  all  the  w^  are  null  (Ref.  2  )  and  we  obtain  the  condition, 


(T^  -  V^)  log  q  q^l  -  C 

-1 

(T^  -  V2)  log  q  q  ^  -  C 

a„ 

+  U° 

-  1 

(T^  -  V^)  log  q  q  -  C 

-  1 

This  is  equivalent  to  saying  that  all  the  components  of  the  first  vector  are 
equal  to  each  other.  This  yields  n  -  1  equations, 

(T^  -  V^)  =  (T^  - 


<^1  -  -l"'  =  <T„  -  V  ■5^" 

th 

The  n  equation  needed  to  determine  the  Lagrangian  multiplier  is  supplied 
by  the  constraint,  i.  e.  , 

n 

S  a,  =  R 


A  little  algebra  reduces  the  above  to, 


which  has  as  a  solution, 


1 


a.  = 
1 


Tk  -  Vk 


£  log  ( 

"  k=l  ^  Ti-Vi 

k^i 


)  -  ^ 
n 


But,  since  log^  1=0  one  can  set 


or. 


a.  = 

1  n 


^i  =  n 


R 

a.  = 

1  n 


n 

£ 

log  ( 

1 

k=l 

q 

T.  -  V. 

n 

1  1 

n 

-  Tk 

R 

£ 

log  ( 

- -  ) 

+ 

k=l 

q 

V-  -  T. 

1  1 

n 

+ 

...SI 

h 

,  n,  (V 

k=l  ^ 

-Tk) 

V.  -  T. 
1  1 


That  is, 


-  R  X  1 
a.  -  —  +  log 

1  n  ^ 


G(n) 


^  V.  _  T. 


T' 
1  1 


where  G(  n)  is  the  geometric  mean  of  the  quantities  (Vj^  -  T^),  k=  1,  2, 

.  .  .  n.  If  on  the  other  hand  some  of  the  aj^'s  should  be  null,  then  the  corre¬ 
sponding  w^  are  going  to  be  non  zero  (Ref.  2  ).  Assume  that  (n  -  X  )  a^^'s 
are  going  to  be  null,  then  the  K  -  T  condition  would  read. 


(Tj  -  Vj)  log  q  -  C 

-  1 

0 

(T2  -  V2)  log  q  q^2  _  c 

-  1 

0 

(T\  -  VX  )  log  q  q  -  C 

+  u° 

-  1 

= 

0 

(TX+i  -  logq  -  C 

-  1 

0 

-w 

X+1 

(T^  -  V^)  log  q  -  C 

-  1 

0 

-  w 

n 

T  -6 


The  first  X  equations  together  with 


n 


L 

k=l 


R 


yield 


+ 


G(X) 

V  -  T. 
i  1 


where  i  =  1,  2,  ...  X  and 


CKX) 


(Vk 


Tk) 


The  remaining  equations  require 

(Ti  -  VJ  log  q  -  C  -  u°  =  -  w° 

i=X  +  1,  X  +  2,  ...  ,  n 

Since  w.  >  0  in  the  above  equation, 

(V.  -  T.)  log  -  -  C  -  u°  <  0 
i  i  ^  q  1 

That  is: 

(V.  -  T.)  <  ^1  ^ 

^  ^  log  1 

q 

i  =  X+1.  X+  2,  ...,  n 
The  first  X  equations,  on  the  other  hand,  yield: 
(T^  -  VJ  log  q  q  i  -  C  -  u°  = 


(I  1.  3-3) 


I  -7 


0 


i.  e.  , 

a-  ^  lop  G(^)  ^ 

(V.  -  T.)  q  J  =  (V.-T.)  qX  q  ^q  (y  -  T  )  =  G{X) 
J  J  J  J  j  j 


Thus,  if  the  i-th  target  should  be  allocated  no  weapons,  its  equivalent  value 
(V^  -  T^)  should  be  such  that 

R 

(V.  -  T.)  <  qX  G(X) 

We  are  thus  in  the  position  to  solve  any  program  of  the  type  (I  1.  3-2).  One 
would  start  by  computing  the  quantity 

R 

qX  G(X) 

with  X  equal  to  the  number  of  maximum  value  targets  and  determining  if 
there  are  any  i's  such  that, 

R 

(V.  -  T.)  >  qX  G{X) 

1  1 

If  one  finds  such  i*s,  one  would  add  them  to  the  set  of  X  targets  and  compute 
R 

qX  G{X) 

computed  over  the  so  obtained  set  of  targets.  This  procedure  is  iterated 
until  one  fails  to  add  any  further  targets,  then  utilizing  equation  (I  1.3-3) 
one  can  determine  the  optimal  allocation  of  weapons  [a.}  and  thus  eventually 
the  information  system  effectiveness- 


C  +  u° 
log  - 

q 
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2. 


SAMPLE  EVALUATIONS 


2.  1  The  Perfect  Information  System 


In  this  section  we  will  actually  compute  the  effectiveness  of  the  perfect 
information  system  for  the  organization  described  in  Section  I.  The  perfect 
information  system  is  represented  by  a  matrix,  P,  which  is  the  identity 
matrix. 


wherei 


is  the  Kroenecker  delta. 


This  simply  means  that  the  messages  and  states  of  the  world  are  in  a  one- 
to-one  correspondence,  i.  e.  ,  the  state  of  the  world  is  completely  and 
accurately  known. 


This  particular  information  system  is  considered  for  two  reasons: 
a)  it  establishes  an  upper  bound  for  the  effectiveness  of  all  possible  infor¬ 
mation  systems,  and  b)  it  poses  a  simple  evaluation  problem. 


2.  1.  I  The  Computation  of  V(Xo) 


We  begin  by  computing  V(Xq)  since  the  effectiveness  of  an 
information  system  E(x)  is  given  by  V(x)  -  V(Xq). 

Xq  is  the  system  characterized  by  the  matrix  Q=  {p(Y  Ix*^)} 
such  that  p(Y^|xj)  =  ^  .  Thus,  for  it  we  have: 


I 

N+  I 


P(xi)  4 


E(  Vj^ 
N+  1 


where  E(xj^)  is  the  expected  value  of  xj^. 
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If  the  prior  distribution  is  uniform,  E(xj^)  =  *2  >  since  the  k-th 
component  of  the  state  vector  is  equal  to  one  in  exactly  half  of  the  N+  1 
distinct  states  and  zero  for  the  other  half,  since  N  +  1  =  2^  is  even. 
Similarly, 

 E(xk)tk 


N+  1 


.Y' 


N  +  1 


Then  using  equation  (  1.3-1)  one  obtains: 


in  \  r  V  /I  E(xi^)vi.  E(xi^)  ti^ 

V(X^)  =  r  max  LI  (1  -  q  )  ^  ^  +  q  ^  ^  - 

i=0  A  k=l  N  +  1  N  +  1 


N  +  1  ^k  ^ 


M+  1 


n 


N  + 


p  max  Cl  (1  -q  ^)E(x^)  +  q  ^  E(x^)  tj^  -  Caj^]  (12.1.1-1) 

A  k=  1 


Assuming  that  all  vj^  and  tj^  are  equal,  that  the  number  M  +  1  of 
messages  and  the  number  N  +  1  of  states  are  equal,  and  that  the  prior  dis¬ 
tribution  is  uniform,  one  obtains 


n 


v(x)  =  max  [I  (l-q^)^  +  q^-|-  Ca^^  ] 
A  k=l 


(  I  2.  1.  1-2) 


With  the  usual  constraints 

I  ■  A  <  R 
[A]  >  0 

It  can  be  seen  that  the  solution  of  this  mathematical  program  is  given  by: 

log  -(26) 

log  q 


Use  the  smaller  of  the  two. 


R 

n 


where 


log  q  (v-t) 


I  -10 


In  the  first  case  one  has 


/V  r  V  1  -  log  ( -  2  6  )  -| 

v  u  =  n  [  -  +  C  - - -  ] 

o  2  log  q 


In  the  second  case  one  has 


R 


V(x^)  =  n  [(1  -  q  ^  )  -  + 


f  ^  - 


CR 


(I  2.  1.  1-3) 


(I  2.  1.  1-4) 


2.  1.  2  The  Computation  of  V(Xp) 


The  system  X^  is  the  system  characterized  by  the  matrix 
P  =  [p  (Y^X^)  ]  =  6ij 
and  thus  by  the  matrix 

Q  =  {p(yMxJ)}  =  6ij 

where  6  is  the  Kroenecker  delta.  Then  for  this  case  one  has  that: 


N 


v: 


j=o 

N 


r  6  p(xJ)  ^  =  p(x") 


r  6  p(xJ)  4  =  p(x‘)  x; 


j=0 

N 


r  6..p(xJ)  c  =  p(x")  c 


J 


1=0 


IJ 


Using  equation  (  I  1.  3-1)  one  obtains: 

M  n  a  .  .  a  .  . 

V(x  )  =  L  max  [  L  (  1-q  )  p  (X^)  x^  +  q  p  (X^)  ^  ^ 

^  i=0  A  k=l 


M=N  .  n  a  .  • 

:  L  p(X^)  max  [£(l-q  ^^k^ 

i=0  A  k=l 


(.1  2.  1.  2-1) 


In  the  event  that  all  v  's  and  t  's  are  equal,  this  reduces  to: 

k  k 


I-ll 


N 

V(x  )  =  r  P(X")  max 
^  i=0  A 


[  L  {(  1  -  q  )  v+  q 
k=l 


Ca^] 


(  I  2.  1.  2-2) 


The  solution  of  the  problem 


Max  t  jE  {( 1  -  q  V  +  q^^  t]  x^  -  Ca^^  ] 
A 


I  •  A  S  R 

[a]  1  0 


(  I  2.  1.  2-3) 


will  depend  on  one  characteristic  of  the  state  vector,  X^;  namely,  the 
number,  X,  of  its  non-zero  components.  (This  is  true  because  of  the 
assumed  homogeneity  of  the  targets.  ) 

If  0  is  the  set  of  non-null  components  of  X^,  the  unconstrained 
maximum  of  the  functional  is  given  by: 


where: 


log  -  6 

log  q 


0 


6  = 


c 

V-  t 


T -  .  The  a.  are  all  positive. 

log  q  k 


The  largest  X  for  which  the  unconstrained  maximum  is  the 
solution  of  the  problem  is  the  largest  integer  A  for  which 


n 

I' 

k=l 


^  R 


or 


A 


loR  -  6 

log  q 


R, 


1.  e.  , 


A  ^ 


R 


log  q 

log  (-6) 


Thus,  for  all  the  states  with  0  S  X  — 


I  -12 


max 


CE  {(l-q^v+q^t}x|^  -  Ca^ ]  = 

A  k=l 

log  -  6  log  -  6 

Xd-q*”*"*  )  v  +  Xq^°«'*  t-X  ci2£^ 

log  q 

X  [v+  C(— r°^  ^  )  3  (12.1.2-4) 

log  q 


If  X  >  A  maximum  is  constrained,  and  it  must  be 
obtained  by  using  the  K-T  condition  [l]  .  If  0  indicates  the  set  of  null 
components  of  X^,  we  can  begin  to  use  the  K-T  conditions  to  determine 
whether  any  weapons  have  to  be  allocated  against  the  corresponding 
targets.  First,  one  has  to  compute 

R 

q~^  G(X) 


where  k  C  0 

X  being,  as  indicated  above,  the  number  of  members  of  0  .  In  this  case. 


G (X  )  =  V  -  t 


Then,  if 


(V.  -  T.)  <  q  X  (v  -  t) 
J  J 


j 


one  would  allocate  no  weapons  against 

if  j  ^  (V  -  T.)  =  0,  and  since 

J  J  q 


the  targets  corresponding  to  6.  But 
(v  -  t)  is  positive  in  all  cases,  we  can 


conclude  that  the  constrained  optimum  will  require  allocating  no  weapons  to 
those  targets  which  do  not  exist,  which  of  course  is  an  expected  result.  The 
remaining  K-T  conditions  for  k  €0  require  that  the  projection  of  the  gradient 
of  the  functional  in  the  subspace  corresponding  to  the  indexing  set  6  be  parallel 
to  the  vector  of  all  ones,  which,  in  turn,  is  equivalent  to  requiring 


I  -13 


a 

(v  -  t)  log  q  q  e 


(v  -  t)  log  q 


C0 


I 

k€  e 


R 


These  are  solved  by 


a 


R 

X 


In  this  case  then 


max 

A 


n 

[L 


k=l 


{(1  -  q^)v  +  q 


Ca  ] 

k 


R  R 

^  [(1  -  q  ^  )  v+  q  t  -  c] 


(  I  2.  1.  2-5) 


The  expected  utility  of  the  perfect  information  transformation 
for  homogeneous  targets  can  be  computed  using  equations  (I  2.  1.  2-2), 

(I  2.  1.2-4),  and  (I  2.  1.2-5)  to  be: 


(x  )  =  2  n  (X)  X  [v  +  c(^  )  ] 


x=  0 


log  q 


n 


R 


L  n  (X)  X  [(1 

X  =  +1 


R 


q^)v+q^  t-C] 


V  (x 


)  =  [v+  c  ^  2  n(X)  X 


log  q 


X=  0 


n 

L 

X=  A  +  1 


R  R 

17  (X)  X  [  ( 1  -  q  "^  )  V  +  q"^  t  -  C] 


(I  2.  1.  2-6) 
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2.  1.  3  Sample  Results 


At  this  point,  subtracting  equation  (I  2.  1.  1-3)  or  (  I  2.  1.  1-4), 
whichever  applies,  from  equation  (I  2,  1.  2-6)  one  obtains 


€(x  )  =  V (x  )  -  V (x^) 

p  P  o 

Graphs  1-4  show  V  (x  )  and  V  (x  )  as  calculated  for  our  organization  using 
the  following  combinations  of  parameters  listed  below. 


#1 

#2 

#3 

#4 

No.  targets,  n 

4 

4 

4 

4 

Target  value,  v 

5000 

5000 

5000 

5000 

Target  threat,  t 

-5000 

-5000 

-5000 

-5000 

Weapon  cost,  C 

1000 

1000 

1 

1 

No.  avail,  weap.  ,  R 

16 

4 

100 

8 

Prior  distribution 

Uniform 

Uniform 

Uniform 

Uniform 

The  above  values  were  chosen  to  create  representative  cases.  As  it  can  be 
seen  in  all  cases,  the  organization  faces  up  to  4  targets,  all  of  equal  value 
and  threat  value.  The  cost  of  the  weapon  is  the  first  variable  in  the  table. 
Graphs  1  and  2  correspond  to  the  case  where  the  weapon  is  worth  1/10  of 
the  equivalent  value  of  the  target.  These  graphs,  therefore,  are  more  rele¬ 
vant  for  decision  making  involving  the  allocation  of  weapon  systems  rather 
than  single  weapons.  The  second  group  of  graphs,  on  the  other  hand 
(numbers  3  and  4),  with  a  weapon  cost  three  orders  of  magnitude  smaller, 
can  be  considered  relevant  for  decision  making  when  allocating  single 
weapons.  The  graphs  are  also  differentiated  on  the  basis  of  whether 
resources  are  abundant  or  scarce.  The  abundant  and  scarce  cases  are 
given  in  graphs  1,  3  and  2,  4,  respectively.  In  all  cases,  the  prior  distri¬ 
bution  is  uniform. 

Graph  3  represents  decision  making  involving  the  use  of 
cheap  resources  in  ample  supply.  It  is  clear  from  this  graph  that  there 
is  essentially  no  benefit  to  be  gained  from  adopting  a  bomb-damage- 
assessment  function. 
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Graph  4  shows  the  effect  of  a  limited  supply  for  a  cheap 
resource.  As  it  can  be  seen  the  effectiveness  of  the  information  system 
increases  with  a  decrease  in  the  reliability  of  the  weapon  employed.  The 
limited  supply  of  weapons  makes  it  impossible  to  saturate  the  targets,  thus 
demonstrating  the  desirability  of  having  bomb-damage  information  in  order 
to  use  one’s  limited  weapons  most  effectively. 

The  perfect  information  system  is  seen  to  be  capable  of 
bringing  about  a  maximum  of  80%  improvement  in  the  performance  of  the 
organization.  In  graph  1,  we  see  the  performance  of  the  organization  when 
its  decision  making  entails  the  use  of  expensive  resources,  such  as  entire 
weapon  systems,  which  are  in  abundant  supply.  In  graph  2,  the  limited 
supply  case,  the  improvement  of  performance  is  considerable,  and  one 
observes  that  the  effectiveness  of  the  perfect  information  system  attains  a 
maximum  for  weapon-kill  probabilities  in  the  range  .  6  to  .7,  More  repre¬ 
sentative  data  could  be  obtained  if  the  prior  state  distribution  used  could  be 
the  binomial  distribution  for  the  targets  previously  killed.  Such  curves 
could  be  obtained  if  the  above  procedure  were  computerized. 
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Expected  Utility 


PERFECT  INFORMATION 


6000 


3000 


0 


-3000 


6000 


3000 


0 


-3000 


Figure  I-  1. 


R  16 

C  .  1000 

p{X^)  Uniform 

V  5000 

t  -5000 

V(Pperfect)  - 

V(Pnull)  - 


R  4 

C  .  1000 

{p(X^)}  Uniform 

V  5000 

t  -5000 

V(Pperfect)  - 

V(Pnull)  - 
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Expe cted  Utility 


PERFECT  INFORMATION 
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R  100 

C  .  1 

{p(X^)}  Uniform 

V  5000 

t  -5000 

V(Pperfect)  - 

V(Pnull)  - 


R  8 

C  .  1 

{p(X^)}  Uniform 

V  5000 

t  -5000 

V(Pperfect)  - 

V(Pnull)  - 


APPENDIX  II 


THE  KUHN- TUCKER  CONDITIONS 


In  this  appendix  some  of  the  key  ideas  concerning  the  solution  of  the 
problem 


Fx  = 


(II- 1) 


£i  (x) 

fm(^) 


are  assumed  to  be  differentiable  functions  in  the  positive  orthant.  We  will 
begin  by  deriving  the  Kuhn-Tucker  conditions  as  the  necessary  conditions 
for  solving  the  mathematical  programming  problem  as  given  by  (II- 1). 


Fx  is  mapping  defined  by  the  m  differentiable  functions  f£(x),  i  =  1,2, 
.  .  .  m.  Consequently,  the  matrix 


6.fi 

6fi  .  . 

.  .  6fi 

6x1 

6x2 

6  Xn 

6f2 

6f2 

6f2 

6xj^ 

6x2 

6-n 

6’fm  .  . 

.  .  6fm 

dxj 

6x2 

=  F^ 


x=x 


II- 1 


exists  (F^  is  computed  at  x  =  x^),  and  so  does 


g°  =  (Ag(x))x=xO 


where  x°  satisfies  the  constraints  of  problem  (ll-l).  Incidentally,  F° 
can  be  written  also  as 

(Afi(x))' 

(Af2(x))' 

F°  = 

(Af^(x))' 


where  the  various  gradients  are  represented  as  column  vectors. 


In  particular,  if  we  assume  that  x®  is  on  the  boundary  of  the  set  Fx  >  0, 

X  >  0,  some  of  these  conditions  will  be  satisfied  at  the  equality  level.  Conse¬ 
quently,  we  may  partition  the  two  sets  of  conditions  in  four  sets  as  follows: 


Fx  >  0 


X  >  0 


C  F^x  >  0 


Fjx®  =  0 


corresponding  to 

F2x>0  corresponding  to  F2X0  >  0 

Vi 

corresponding  to  Ijx®  =  0 
^I2X  >  0  corresponding  to  ^2^^  —  ® 


I^x  >  0 


F]^x  is  the  mapping  obtained  from  Fx  by  striking  out  those  functions  f^(x) 
for  which 


f.(x°)  >  0 

F2X  is  simply  the  list  of  the  so  eliminated  functions.  and  are  matrices 
obtained  from  the  identity  matrix  by  collecting  the  rows  corresponding  to 
null  components  of  yP  and  non-null  components,  respectively. 

If  X®  is  a  maximum  for  problem  (ll~l),  then  it  must  be  true  that 

gO'  dx  <  0 


II-2 


o  ^ 

for  all  feasible  dx*s.  In  fact,  g  dx  =  (Ag)x=x^  *  along  dx.  The 

feasible  or  allowed  dx  are  those  which  do  not  cause  an  exit  from  the  feasibility 
region.  Now,  we  have  that 


dx  =  X  -  x^ 


and 


Fx  =  FxO  +  F°(x  -  x°) 


since  dx  =  x  -  x°  is  assumed  to  be  infinitesimal.  Consequently,  for  the  con- 
ditions  involving  the  F^x  part  of  the  mapping,  we  have 

FjX  =  Fjx^  +  Fj  (x  -  xO)  =  F^  (x  -  x^) 

since  by  definition  F  jX^  =  0.  Finally,  since  F  j  x  >  0,  it  follows  that 

F^  (x  =  x^)  =  Fi  dx  >  0,  Similarly,  the  x  >  0  constraint  yields  for  the 
displacement  dx  the  condition 

dx  0. 

Thus,  we  can  say  that  for  x^  to  be  a  solution  of  (II- 1)  it  is  necessary  that 


g° 

dx  < 

0 

for  all  dx 

(II-2) 

dx  > 

0 

ii 

dx  > 

0 

At  this  point  we  apply  the  following  theorem  due  to  Farkas: 
b*x  >  0  for  all  x*s  3  Ax  >  0  only  if 

b  =  A*t  for  some  t  >  0 

From  (II-2)  and  this  theorem,  one  obtains 


1 

1 

OQ 

0 

II 

ii 

It 

ii 

1 

II- 3 


(11-3) 


for  some  >  0,  >  0,  or 

-g^  =  +  I*^  for  some  >  0,  >  0 

and  adding  appropriate  zeros  to  and  to  form  and  w^,  one  can  write 
(B-3)  as 


-g^  =  F^  +  w^  for  some  ^  0,  w^  >  0 
Thus  we  can  conclude  that  the  condition 

gO  +  F^*  uO  =  -w®  wO  >  0,  u®  >  0 

where 


(II-3*) 


(II- 4) 


g°  =  (Ag(x))^^^o 


fl  (x)' 

h 


f 

m 


(x)* 


x=xO 


is  a  necessary  condition  for  x 


o 


to  be  a  solution  of 


^  Max  g(x) 


< 


(x)  >  0 
^2  (^)  ^  0 


(x)^  0 

X  >  0 

V 


(ii-i') 


where  the  f^  (x)  and  g(x)  are  required  to  be  differentiable  in  X  >  0.  In  our 
case  g(x)  is  (strictly)  concave  and  the  fi(x)  are  linear.  Let*s  prove  that 
under  these  circumstances  (II-3*)  is  sufficient  also.  From  Reference  [2] 
sufficient  condition  for  x^  to  be  a  solution  for  (II- 1)  is  that 


II-4 


(I) 

$°  <  0 

X 

«i>°'  x°  =  0 

X 

x°  >  0 

and 

(II) 

>  0 

U  “ 

<!'  u  =  0 

u 

u°  >  0 

and 

(III) 

p 

0 

<  4*  (x°,  u°)  + 

(x  -  xO) 

^>(x. 

u)  =  g(x)  +  u 

'Fx 

Furthermore,  and  are  the  gradients  of  (x,  u)  with  respect  to, 

respectively,  x  and  u  and  computed  at  the  (x^,  u^)  point.  We  will  begin  by 
showing  that  in  our  case  (III)  is  indeed  verified. 

The  assumption  that  g(x)  is  a  concave  function  is  equivalent  to  stating 

that 


g(x)  <  g(xO)  +  gO'(x  -  xO)  (II-5) 

The  assumption  that  the  fi(x)*s  are  linear  functions  is  equivalent  to  stating 
that 

f.(x)  =  f.(xo)  +  fi(xO)  (x  -  x^) 

and  thus  that 


Fx  =  Fx^  +  F^(x  -  X®) 

If  (11-6)  is  premultiplied  by  u°  and  added  to  (II-5),  one  obtains 
g(x)  +  u^  Fx  <  g  (x®)  +  u^*  Fx®  +  (g^  +  u^  F^)  (x  -  x^) 

Now  it  can  be  easily  verified  that 

=  g  +  F^  u 

Thus,  (II- 7)  is 

$  (x,  u® )  <  ^  (x^,  u^)  +  ^  (x  -  x^) 

X 

which  is  condition  III.  Since  Reference  2  shows  (I)  and  (II)  can  be  derived 
from  (11-3^),  this  establishes  the  sufficiency  of  (11-3^).  Thus,  to  state  (II-3*)  is 
in  our  case  equivalent  to  stating  that  x^  is  a  solution  of  the  mathematical  pro¬ 
gramming  problem  (II- 1). 


(IL6) 

(11-7) 


II- 5 


In  addition,  it  is  well  worth  noting  the  following  properties.  The  deriva¬ 
tion  of  (II-3*)  in  Reference  [2]  makes  it  clear  that  the  vector  has  zeros  in 
correspondence  to  those  fi(x)*s  which  enter  in  the  constraints  which  are  not 
satisfied  at  the  equality  level;  i.  e.  ,  for  each 

fi(x°)  >  0 

there  is  a  corresponding  zero  in  u^.  Similarly,  for  each 

X?  >  0 
1 

(x^  is  the  i^^  component  of  the  vector  x^)  there  is  a  null  component  in  the 
w^  vector. 
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